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Progress  Report  for  the  DOD  Grant  DAMD  17-98-1-8042 

April  1999- April  2000  ( second  year  of  support) 


During  the  second  year  of  support  by  the  grant  DAMD  17-98- 1-8042  (the  Idea  Grant)  we 
completed  several  projects  described  last  year,  and  continued  with  several  other  projects. 

Signal-regulated,  cleavage-mediated  toxins 

During  1999  and  early  2000,  we  completed  the  work  on  the  first  example  of  a  cleavage- 
mediated  conditional  toxin.  A  draft  of  the  paper  about  this  class  of  conditional  toxins,  to  be 
submitted  for  publication  soon  (it  is  being  revised  as  I  write)  is  enclosed.  (Davydov  & 
Varshavsky.  Conditional  toxin  against  yeast  cells  expressing  HIV  protease.  J.  Biol.  Chem.  (to 
be  submitted  in  summer;  early  preprint  enclosed). 

Construction  of  degradation  signals  in  the  lysine-asparagine  sequence  space 

Among  the  problems  to  be  solved  on  the  way  to  multitarget  conditional  toxins  is  the 
paucity  of  strong,  portable  N-degrons  that  are  free  of  drawbacks  of  the  existing  degrons  of  this 
class.  These  drawbacks  include  the  tendency  of  proteins  bearing  such  degrons  to  undergo 
endoproteolytic  cleavages  that  sever  the  N-degron  from  the  rest  of  the  protein,  and  thereby 
contribute  to  the  high  background  of  assays  with  N-degron-bearing  proteins.  As  a  part  of  our 
ongoing  effort  to  construct  better  degradation  signals,  we  developed  a  new  approach,  which 
makes  it  possible  to  identify  and  isolate  specific  degradation  signals  in  the  sequence  space  of  just 
two  amino  acids,  lysine  and  asparagine  A  paper  describing  these  results  was  published  in  late 
1999.  (Suzuki  &  Varshavsky.  Degradation  signals  in  the  lysine-asparagine  sequence  space. 
EMBO  J.  18:6017-6026,  1999.)  (reprint  enclosed.! 

Bivalent  Inhibitor  of  Protein  Degradation  by  the  N-End  Rule  Pathway 

In  a  project  relevant  to  the  theme  of  this  grant,  we  have  developed  a  specific  bivalent 
inhibitor  of  the  N-end  rule  pathway.  The  relevance  of  this  advance  stems  from  the  fact  the 
bivalent  inhibitor  (described  below)  improved  the  understanding  of  the  targeting  (substrate¬ 
binding)  sites  of  N-recognin,  the  E3  component  of  the  ubiquitin-dependent  N-end  rule  pathway, 
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which  is  at  the  center  of  our  sitoxins  and  comtoxin  projects.  Eventually,  the  designs  of 
conditional  toxins  should  be  able  to  involve  any  of  the  several  ubiquitin-dependent  proteolytic 
pathways;  the  current  focus  on  the  N-end  rule  pathway  reflects  not  only  the  interest  of  this 
laboratory  but  also  the  fact  that  this  pathway  is  particularly  well  understood  mechanistically. 
Moreover,  in  staying  within  the  N-end  rule  pathway,  it  is  possible  to  use  the  portable,  highly 
active  N-degrons,  an  advantage  that  is  not  available  with  most  of  the  other  ubiquitin-dependent 
pathways.  A  paper  describing  these  results  was  published  in  1999.  (Kwon  &  Varshavsky. 
Bivalent  inhibitor  of  the  N-end  rule  pathway.  J.  Biol.  Chem.  274: 18135-18139,  1999.)  (reprint 
enclosed.) 

Some  of  the  Projects  in  Progress 

Construction  and  Use  of  the  Library  Expressing  Random  Fragments  of  the 
Green  Fluorescent  Protein  (GFP)  and  Src  Protein  Kinase 

The  aim  of  this  ongoing  project  is  to  develop  a  generally  applicable  method  for 
interfering  with  the  in  vivo  function  of  a  target  protein  of  interest  through  the  expression  of  a 
specific  fragment  of  the  target  that  would  bind  to  a  conformationally  immature  (nascent)  target 
protein  and  thereby  either  arrest  or  strongly  retard  its  conformational  maturation.  As  a  result,  the 
complex  of  the  target  and  its  self-peptide  would  be  nonfunctional  insofar  as  the  target’s  function 
is  concerned.  In  addition,  the  interfering  peptide  would  be  equipped  with  a  strong  N-terminal 
degradation  signal  (the  N-degron,  previously  studied  by  this  laboratory).  A  permanently  or 
transiently  stable  complex  between  a  target  protein  and  its  self-peptide  bearing  an  N-degron 
would  be  targeted  for  processive  degradation  by  the  N-end  rule  pathway,  one  of  the  pathways  of 
the  ubiquitin  system.  Once  developed,  this  method  could  be  applied  to  effect  selective  functional 
inhibition  of  undesirable  proteins  such  as,  for  example,  overexpressed  oncoproteins.  This 
trans-degradation  method  would  also  become  a  part  of  the  comtoxin  approach  described  in  the 
DOD  Idea  Grant  that  supports  these  studies.  To  develop  the  trans-degradation  technique,  we 
chose  Green  Fluorescent  Protein  (GFP)  as  a  model  target  protein.  (Our  next,  clinically  relevant, 
target  is  Src,  a  protein  tyrosine  kinase  and  an  proto-oncoprotein  activated  in  a  number  of  human 
cancers,  including  breast  cancer.)  We  carried  out  the  construction  of  a  library  of  clones  which 
expressed  polypeptide  fragments  derived  from  random  segments  of  GFP.  These  fragments  were 
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expressed  in  the  yeast  S.  cerevisiae  from  the  PGAL  promoter  as  linear  fusions  to  ubiquitin.  As 
shown  previously  by  this  laboratory,  the  N-terminal  ubiquitin  moiety  of  the  fusion  is 
cotranslationally  cleaved  off  by  ubiquitin-specific  proteases,  making  it  possible  to  expose  any 
desired  amino  acid  residue  at  the  N-terminus  of  the  resulting  polypeptide  fragment.  In 
developing  this  approach,  we  are  proceeding  in  steps,  at  first  constructing  a  ubiquitin  fusion- 
based  library  of  GFP  fragments  as  such.  The  second-generation  library  will  have  the  same 
fragments  bearing  the  40-residue  N-terminal  extension  called  Arg-eAK,  which  contains  the 
above-described  N-degron.  The  library  expressing  random  GFP  fragments  was  constructed 
using  DNase  I  in  the  presence  of  Mn2+  ions  to  fragment  the  GFP  cDNA,  and  specific 
oligonucleotide  adapters  to  insert  the  resulting  fragments  into  the  expression  vector.  More 
recently,  we  began  the  construction  of  analogous  libraries  with  v-Src  and  c-Src  trans-targeting, 
degron-containing  reporter.  The  v-Src  and  c-Src  cDNAs  were  obtained  from  Dr.  D.  Morgan 
(Univ.  of  California,  Berkeley).  They  were  tested  by  sequencing  the  5’-  and  3’-ends  (-200  bp 
each)  of  the  coding  regions,  and  confirming  that  the  resulting  sequences  matched  those  for  v-Src 
and  c-Src  in  the  GenBank  database.  Restriction  fragments  containing  the  Src  genes  were  isolated 
by  agarose  gel  electrophoresis,  and  are  being  used  to  construct  random-fragment  libraries  as 
described  above.  Taking  advantage  of  the  unique  Nco  I  site  at  the  5 ’-end  the  v-Src  cDNA,  the  v- 
Src  coding  region  (as  a  filled-in  Nco  I-Xho  I  fragment)  was  inserted  into  the  integration  vector 
pIP123-CUPR.  This  placed  the  Src  ORF  under  the  control  of  the  copper-inducible  Pcupl 
promoter.  It  remains  to  be  determined,  in  this  ongoing  project,  whether  this  promoter  is 
sufficiently  tight  (low  enough  level  of  Src  in  the  uninduced  state)  and  sufficiently  strong  (high 
enough  level  of  Src  in  the  induced  state  to  cause  growth  arrest).  (In  contrast  to  the  GFP-based 
library,  where  the  screen  is  being  carried  out  by  fluorescence-activated  cell  sorting  (FACS),  our 
readout  and  the  screen  with  Src  are  based  on  the  known  toxicity  of  mammalian  Src  to  the  yeast 
S.  cerevisiae.)  Examination  of  the  three-dimensional  structures  of  Src  family  tyrosine  kinases 
shows  that  a  loop  connecting  the  SH2  domain  to  the  kinase  domain  is  bound  to  the  SH3  domain 
of  Src  and  sandwiched  between  the  SH3  domain  and  the  N-terminal  half  of  the  kinase  when  Src 
is  in  the  inactive  state.  The  kinase-inhibiting  interaction  requires  phosphorylation  of  Src  at  its  C- 
terminal  region,  itself  bound  to  the  SH2  domain.  This  area  is  a  promising  candidate  for  a  trans¬ 
targeting  N-degron.  Our  aims  in  this  set  of  projects  are,  first,  to  optimize  the  library-based 
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interference  approach  described  above,  using  GFP  as  both  the  target  and  reporter,  then  to  expand 
this  approach  to  the  Src  kinase,  and  to  link  an  N-degron  to  the  interfering  fragments  identified  in 
both  the  GFP  and  Src  library  screen.  This  multistep  strategy  should  result  in  a  generally 
applicable  method  for  metabolically  destabilizing  (and  thereby  functionally  inhibiting)  any 
protein  of  interest,  including  the  targets  that  must  be  down-regulated  for  either  arresting  or 
selectively  killing  the  breast  adenocarcinoma  cells. 

Over  the  last  year,  this  project  ran  into  technical  difficulties,  but  we  are  committed  to  it, 
and  are  continuing  this  work. 

Designing  Asparagine-Lysine  Degrons  That  Work  in  Mammalian  Cells 

We  are  beginning  to  test  the  lysine-asparagine  N-degrons  of  S.  cerevisiae  identified  in  the 
preceding  work  supported  by  this  grant  (Suzuki  &  Varshavsky.  Degradation  signals  in  the 
lysine-asparagine  sequence  space.  EMBO  J.  18:6017-6026,  1999)  (reprint  enclosed!  in 
mammalian  cells.  Our  aim  is  to  determine  whether  any  of  the  highly  active  yeast  N-degrons  of 
this  class  that  have  been  identified  in  yeast  are  comparably  powerful  in  mammals.  If  they  are, 
several  applications,  including  those  that  encompass  conditional  toxins  will  become  possible 
right  away.  If  these  degrons  are  not  as  strong  in  the  mammalian-cell  setting,  we  will  modify 
them,  using  techniques  that  have  already  proved  successful  in  yeast,  to  produce  a  set  of  degrons 
that  would  confer  on  a  mammalian  protein  reporter  an  in  vivo  half-life  of  3  minutes  or  less.  With 
degrons  of  this  quality,  one  could  design  proteolysis-based  conditional  mutants  in  mice,  and 
conditional  toxins  as  well.  This  project  is  under  way. 


Enclosed  with  this  Progress  Report  are  the  recently  published  papers  (1999-2000;  three 
sets),  and  three  copies  of  the  Pi’s  CV.  The  Progress  Report  describes  projects  that  were 
supported,  either  partially  or  entirely,  by  the  DOD  Idea  Grant  during  its  second  year. 
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ABSTRACT 

A  cleavage-activated  conditional  toxin  is  an  engineered  protein  fusion,  whose 
toxicity  is  enhanced  as  a  result  of  proteolytic  cleavage  by  a  viral  protease  (PR).  A 
conditional  toxin  of  this  type  contains  (a)  a  cytotoxic  domain;  (b)  an  inactivating  domain; 
and  (c)  a  PR  cleavage  site  located  between  the  cytotoxic  and  the  inactivating  domains. 
The  presence  of  the  inactivating  domain  greatly  reduces  the  toxicity  of  the  cytotoxic 
domain.  Cleavage  of  the  fusion  by  PR  leads  to  the  release  of  the  toxic  domain,  resulting 
in  its  higher  toxicity.  We  constructed  a  cleavage-mediated  conditional  toxin  with 
enhanced  toxicity  against  yeast  cells  expressing  PR  of  the  human  immunodeficiency 
virus  (HIV  PR).  In  our  design  the  A  fragment  of  diphtheria  toxin  (DTA)  serves  as  a 
cytotoxic  domain  and  the  yeast  transcriptional  repressor  Cup9  serves  as  an  inactivating 
domain.  Cup9  is  a  metabolically  unstable  protein  with  predicted  nuclear  localization. 

The  cleavage-specific  toxic  effect  of  the  Cup9-DTA-based  conditional  toxin  that  we 
observed  may  have  two  underlying  contributing  mechanisms.  First,  cleavage  may  lead  to 
metabolic  stabilization  and,  as  a  result,  intracellular  accumulation  of  DTA.  Second, 
cleavage  may  interfere  with  nuclear  localization  of  DTA  leading  to  its  greater  toxicity. 
Our  system  represents  a  fast  experimental  approach  to  construct  and  test  conditional 
toxins  directed  against  HIV  PR-containing  cells. 
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INTRODUCTION 


The  human  immunodeficiency  vims  (HIV)  protease  (PR)  is  essential  at  the  later 
stages  of  the  replication  cycle  of  the  virus.  It  is  responsible  for  proteolytic  cleavage  of 
Gag  and  Gag-Pol  polyprotein  precursors  at  specific  cleavage  sites  [Dougherty,  1993 
#270].  Treatment  of  HIV  infection  with  PR  inhibitors  in  combination  with  other 
antiretroviral  drugs  can  produce  sustained  suppression  of  HIV  replication  and  reduction 
of  viral  loads.  However,  the  vims  continues  to  exist  latently  in  some  infected  cells,  and 
this  represents  a  major  hurdle  to  its  complete  eradication  [Wong,  1997  #267;  Finzi,  1997 
#268].  In  the  long  mn,  the  emergence  of  dmg-resistant  strains  of  HIV  may  create  another 
therapeutic  problem.  Therefore,  some  research  groups  focused  on  development  of  an 
alternative  anti-HIV  approach  that  would  ensure  death  of  HIV-infected  cells  thus 
eliminating  endogenous  viral  pools.  Thus,  it  was  reported  that  fusions  of  toxic  proteins 
and  human  CD4  molecule  display  selective  toxicity  toward  cells  expressing  the  HIV 
envelope  glycoprotein  and  therefore  represent  a  potential  therapeutic  tool  to  kill  HIV- 
infected  cells  [Aljaufy,  1994  #269;  Aullo,  1992  #271;  Till,  1988  #272;  Chaudhary,  1988 
#273], 

We  previously  proposed  an  idea  of  a  new  class  of  conditional  toxins  (termed 
wtoxins  for  signal-regulated,  cleavage-mediated  toxins)  that  contain  a  toxic  domain 
linked  to  a  signal  that  inactivates  it  [Varshavsky,  1996  #89].  In  the  presence  of  HIV  PR 
the  toxic  domain  would  be  released  from  the  inactivating  signal  and  becomes 
significantly  more  toxic.  We  proposed  that  a  degradation  signal  or  a  nuclear  localization 
domain  could  serve  as  an  inactivating  signal  of  a  sitoxin.  Recently  it  was  reported  that  an 
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engeneered  version  of  the  apoptosis-promoting  caspase-3  can  be  activated  by  the  HIV 
PR,  leading  to  apoptosis  of  the  HIV-infected  cells  [Vocero-Akbani,  1999  #265]. 

In  the  present  work  we  describe  the  construction  of  a  conditional  toxin  based  on 
the  fusion  between  the  yeast  transcriptional  repressor  Cup9  and  the  A  fragment  of 
diphtheria  toxin  (DTA)  linked  by  the  polypeptide  sequence  containing  an  HIV  PR 
cleavage  site.  This  conditional  toxin  dispalys  increased  toxicity  against  yeast  cells 
expressing  HIV  PR.  Our  system  may  serve  as  a  model  for  construction  and  quick  testing 
of  HIV  PR-directed  conditional  toxins. 

EXPERIMENTAL  PROCEDURES 

Plamids  and  the  yeast  strain  -  p416Gall  and  p414Met25  yeast  expression  vectors 
[Mumberg,  1994  #279]  were  a  generous  gift  of  M.  Funk  (Institute  of  Molecular  Biology, 
Marburg,  Germany).  pSE979  plasmid  containing  DTA  ORF  [Bellen,  1992  #286]  was  a 
kind  gift  of  S.J.  Elledge  (Howard  Hughes  Medical  Institute).  The  construct  NefPR 
[Arrigo,  1995  #280]  was  a  kind  gift  of  SJ.  Arrigo  (Medical  University  of  South 
Carolina). 

The  plasmid  p416Gall-Cup9-LHIV-DHFR-myc  drives  the  expression  of  Cup9- 
LHIV-DHFR-myc  fusion  protein  in  yeast.  The  Cup9-LHIV-DHFR-myc  ORF  was  cloned 
between  the  Xbal  and  HindlTL  sites  of  the  vector  p416Gall.  The  full-length  Cup9  ORF 
flanked  with  restriction  sites  Xbal  at  the  5 ’-end  and  BamHl  at  the  3 ’-end  was  generated 
by  PCR  from  yeast  genomic  DNA.  The  LHIV  linker  encoding  HIV  PR  cleavage  site 
VSFNF/PQITL  and  flanked  by  restriction  sites  BamHl  at  the  5 ’-end  and  Bglll  at  the  3’- 
end  was  generated  by  inserting  the  oligonucleotide  duplex  5’- 
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AGGATCCGT  ATCCTTT  AACTTCCCTC  AG  ATC  ACTCTTGGC  AGATCTGGTAC-3  ’ 


and 

5  ’  -C  AGATCTGCCAAGAGTGATCTGAGGGAAGTTAA  AGGATACGG  ATCCTG- 
TAC-3’.  The  full-length  mouse  DHFR  sequence  was  flanked  with  Bglll  site  at  the  5’-end 
and  Kpnl  site  at  the  3 ’-end.  The  linker  encoding  myc  epitope  with  terminator  codon  at  it’s 
end  was  flanked  by  Kpnl  site  at  it’s  5’-end  and  Hindlll  site  at  the  3’-end.  The  complete 
details  of  the  construction  procedure  are  available  upon  request. 

p416Gall-Cup9(82-306)-LHIV-DHFR-myc  was  constructed  by  replacing  the 
full-length  Cup9  ORF  in  p416Gall-Cup9-LHIV-DHFR-myc  between  Xbal  and  BamHl 
sites  with  the  truncated  Cup9(82-306)  ORF  generated  by  PCR.  p416Gall-Cup9(82-306)- 
LHIV-DTA  was  produced  by  replacing  the  DHFR-myc -encoding  fragment  between  the 
restriction  sites  Bglll  and  Xhol  in  p416Gall-Cup9(82-306)-LHIV-DHFR-myc  (Xhol  site 
is  present  downstream  from  Hindlll  site  in  p416Gall  vector)  with  DTA-encoding 
fragment.  In  p416Gall-Cup9(82-306)-LHIVmut-DTA  the  sequence  5’- 
GTATCCTTTAACTTCCCTCAGATCACTCTT-3’  encoding  HIV  PR  cleavage  site  was 
changed  to  5  ’  -GT ATCCTTT AACG7TCCTCAG  ATC  ACTCTT-3  ’ . 

Construction  of  the  plasmid  used  for  expressing  HIV  PR  in  yeast  involved  the 
following  steps.  The  NefPR  plasmid  for  HIV  PR  expression  in  mammalian  cells  was 
digested  with  Spel  and  two  copies  of  the  following  oligonucleotide  duplex  encoding  flag 
epitope  were  inserted:  5  ’  -CT  AGTGGCG  ACT  AC  A  AGG  ACG  ACG  ATGAC  A  AGT-3  ’  and 
5’-CTAGACTTGTCATCGTCGTCCTTGTAGTCGCCA-3’.  The  resulting  construct 
pNef(flag)2PR  was  used  to  asses  the  efficiency  of  self-cleavage  by  HIV  PR.  To  subclone 
the  Nef(flag)2PR  ORF  into  a  yeast  expression  vector  we  introduced  a  EcoRI  site  upsteam 
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from  the  5 ’-end  of  the  ORF,  excised  it  with  EcoRI  enzyme  (the  second  £coRI  site  is 
present  at  the  3 ’-end  of  the  ORF)  and  subcloned  it  into  the  p414Met25  plasmid  for  yeast 
expression. 

The  yeast  strain  used  in  current  work  was  JD47-13C  [Madura,  1993  #61]. 

Western  Blot,  immunoprecipitation  and  pulse-chase  assays  -  were  done 
essentially  as  previously  described  [Ghislain,  1996  #126].  Anti-myc  monoclonal 
antibodies  9E10  were  purchased  from  Berkeley  Antibody  company  (CA). 

Yeast  viability  assays  -  JD47-13C  yeast  were  transformed  simultaneously  with 
two  plasmids:  p416Gall-Cup9(82-306)-LHIV-DTA,  p416Gall-Cup9(82-306)-LHIVmut- 
DTA  or  p416Gall  as  a  first  plasmid;  and  p414Met25-Nef(flag)2PR  or  p414Met25  as  a 
second  plasmid.  The  transformation  procedure  invilved  yeast  competent  cells  stored  at  - 
80°C  [Dohmen,  1991  #287].  Primary  transformants  were  grown  on  plates  containing 
0.67%  yeast  nitrogen  base  without  amino  acids  (Difco)  supplemented  with  auxotrophic 
amino  acids  His,  Leu  and  Lys,  2%  glucose  and  1  mM  Met  for  repression  of  the  Met25 
promoter  (growth  medium)  with  2%  agar.  Colonies  were  picked  and  grown  24  hours  in  2 
ml  of  the  liquid  growth  medium.  Next  day  the  cultures  were  diluted  to  OD600~0.2  in  the 
growth  medium  and  grown  until  OD600~0.5-1.5.  Aliquotes  of  about  0.65  optic  units  (1  ml 
of  the  culture  with  OD^-OAS)  were  taken  from  each  culture,  washed  twice  with  the 
induction  medium  (0.67%  yeast  nitrogen  base  supplemented  with  His,  Leu  and  Lys  and 
2%  galactose)  and  resuspended  in  430  jxl  of  the  induction  medium.  The  cultures  were 
left  on  the  bench  for  45  min  and  spread  on  Petri  dishes  containing  the  induction  medium 
with  2%  agar  using  a  bacteriological  loop.  Yeast  growth  on  the  plates  was  monitored  for 
5  days. 
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RESULTS  AND  DISCUSSION 


The  general  idea  of  a  sitoxin  is  illustrated  on  Figure  1 .  The  crucial  property  of  a 
si  toxin  is  that  it’s  toxicity  is  greatly  enhanced  as  a  result  of  proteolytic  cleavage  by  HIV 
PR.  In  general,  a  conditional  toxin  of  this  type  is  a  protein  fusion  that  contains  (1)  a 
cytotoxic  domain;  (2)  an  inactivating  domain  (see  below) ;  and  (3)  a  domain  located 
between  the  two  aforementioned  domains  and  bearing  the  recognition  and  cleavage  sites 
of  HIV  PR.  The  presence  of  the  inactivating  domain  greatly  reduces  the  toxicity  of  the 
cytotoxic  domain,  ideally  making  it  non-toxic  as  a  part  of  the  fusion.  In  the  presence  of 
HIV  PR  the  fusion  would  be  cleaved  and  the  toxic  domain  released  from  the  fusion, 
leading  to  its  greatly  enhanced  toxicity. 

One  may  invision  different  possible  mechanisms  of  inhibiting  the  toxicity  of  the 
cytotoxic  domain  by  the  inactivating  domain.  In  the  first  possible  mechanism,  the 
inactivation  domain  may  contain  a  degradation  signal,  making  the  whole  fusion  short¬ 
lived  intracellularly,  and  therefore  relatively  non-toxic.  The  HIV  PR  would  cleave  the 
fusion  resulting  in  metabolic  stabilization  of  the  toxin  and  therefore  its  intracellular 
accumulation  and  higher  toxicity.  In  the  second  possible  mechanism,  the  inactivation 
domain  may  contain  a  translocation  signal,  such  as  NLS.  If  the  action  of  the  toxin  is 
cytosol-specific  and  the  NLS-toxin  fusion  is  rapidly  translocated  into  the  nucleus,  then 
the  toxic  domain  would  be  unable  to  exert  its  toxic  effect.  Cleavage  by  the  HIV  PR 
would  result  in  the  physical  uncoupling  of  NLS  and  the  toxic  domain.  Having  lost  the 
NLS,  the  toxic  domain  would  remain  in  the  cytosol  and  become  significantly  more  toxic. 
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Some  other  mechanisms  of  action  of  the  inactivation  domain  of  a  conditional  toxin, 
including  sterical  masking  of  the  active  site  of  a  toxin,  are  possible  as  well. 

We  decided  to  construct  a  version  of  conditional  toxin  based  on  a  fusion  between 
the  yeast  homeodomain  protein  Cup9  as  inactivation  domain  and  DTA  as  toxic  domain. 
Cup9  is  a  metabolically  unstable  protein  with  a  half-life  5-10  min,  which  is  degraded  by 
the  Ubrl -dependent  degradation  pathway  (also  referred  to  as  “the  N-end  rule  pathway”) 
in  yeast  [Byrd,  1998  #274],  Cup9  also  acts  as  a  transcriptional  repressor  of  the  PTR2 
gene,  encoding  a  peptide  transporter  [Byrd,  1998  #274],  Cup9  is  predicted  to  be  a 
nuclear  protein  functionally.  It  also  has  three  positively  charged  clasters  of  amino  acids 
at  positions  156  to  160, 217  to  220,  and  286-289;  similar  clasters  have  been  identified  to 
be  important  in  the  localization  of  nuclear  proteins  [Knight,  1994  #95].  We  reasoned  that 
a  Cup9-based  inactivation  domain  would  combine  the  properties  of  both  types  of  the 
inactivation  signals  described  above,  inhibiting  the  action  of  the  toxic  domain  by  both 
metabolic  destabilization  and  translocation  into  the  nucleus. 

DTA  is  the  enzymatically  active  fragment  of  diphtheria  toxin.  It  inhibits  cellular 
protein  synthesis  in  eukaryotes  by  inactivating  elongation  factor  2  through  ADP- 
ribosylation  [Moss,  1990  #277].  Therefore,  it  is  expected  to  be  significantly  less  toxic 
when  directed  for  localization  into  the  nucleus.  Yeast  were  shown  to  be  susceptible  to  the 
DTA  toxicity  [Chen,  1985  #276],  making  them  a  convenient  model  to  demonstrate  the 
sitoxin  principle. 

The  HIV  PR  cleavage  site  engineered  between  the  Cup9  and  DTA  domains  has 
the  sequence  VSFNF/PQITL,  which  represents  the  cleavage  site  at  the  NH2-terminus  of 
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the  HIV  PR  in  the  Gag-Pol  polyprotein  precursor,  which  is  cleaved  most  efficiently 
[Krausslich,  1989  #275]. 

Figure  2  shows  schematically  the  recombinant  genes  used  in  the  present  study. 
Initially  we  constructed  the  protein  fusion  named  Cup9-LHIV-DHFR-myc.  In  this  name 
Cup9  represents  the  full-length  Cup9  protein;  LHIV  denotes  the  19  amino  acid  (aa)  linker 
containing  the  VSFNF/PQITL  cleavage  site  of  the  HIV  PR;  DHFR  stands  for  the  full- 
length  mouse  DHFR  aa  sequence;  and  myc  is  the  epitope  for  immunodetection.  We 
intended  to  use  this  fusion  as  a  reporter  in  preliminary  experiments  by  coexpressing  it 
together  with  the  HIV  PR  and  monitoring  the  efficiency  of  its  cleavage  in  Western  blots 
and  immunoprecipitations  with  anti-myc  antibodies.  However,  the  Cup9-LHIV-DHFR- 
myc  appeared  to  be  toxic  when  overexpressed  in  yeast  apparently  because  of  the  presence 
of  the  full-length  Cup9  moiety,  since  Cup9-myc  was  also  significantly  toxic  upon 
overexpression  (Fig.  3;  and  C.  Byrd,  I.  V.  D.,  and  A.  V.,  unpublished  data).  We  decided 
to  modify  the  Cup9  sequence  to  reduce  its  toxicity.  Preliminary  evidence  indicated  that 
the  degradation  signal  of  Cup9  was  located  in  the  C-terminal  portion  of  the  protein  (C. 
Byrd,  I.  V.  D.,  and  A.  V.,  unpublished  data).  Therefore,  we  constructed  truncated 
versions  of  the  original  Cup9-LHIV-DHFR-myc  reporter  fusion  containing  deletions 
from  the  N-terminus  of  Cup9.  In  this  way  we  were  attempting  to  obtain  a  less  toxic 
reporter  protein  that  would  retain  the  Cup9  degradation  signal.  One  of  the  thus  modified 
fusions,  named  Cup9(82-306)-LHIV-DHFR-myc,  had  the  81  N-terminal  aa  of  Cup9 
deleted  (Fig.  2).  Upon  overexpression  in  yeast  under  the  control  of  Gall  promoter  from 
the  centromeric  p416Gall  vector  [Mumberg,  1994  #279],  this  truncated  fusion  was 
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significantly  less  toxic  then  the  original  Cup9-LHIV-DHFR-myc  (Fig.  3).  Therefore,  in 
the  subsequent  experiments  we  used  the  Cup9(82-306)-LHIV-DHFR-myc  reporter. 

As  a  source  of  HIV  PR  we  employed  the  chimeric  protein  NefPR  [Arrigo,  1995 
#280].  NefPR  contains  the  complete  HIV  PR  sequence  flanked  on  both  sides  with  the 
sites  for  self-cleavage  and  embedded  into  the  heterogenous  aa  sequence  of  the  HIV  Nef 
protein.  As  a  part  of  this  fusion  the  HIV  PR  was  shown  to  be  efficiently  expressed  and 
self-cleaved  in  mammalian  cells  [Arrigo,  1995  #280;  Arrigo,  1995  #278].  We  added  the 
double  flag  epitope  to  the  Nef  portion  of  the  fusion  and  confirmed  that  the  Nef  domain 
was  released  as  a  result  of  the  self-cleavage  of  the  HIV  PR  (data  not  shown).  Then  we 
placed  the  Nef(flag)2PR-encoding  ORF  under  the  control  of  the  Met25  promoter  into  the 
yeast  vector  p414Met25  [Mumberg,  1994  #279]  (Fig.  2)  and  used  the  resulting  construct 
for  the  next  experimental  steps. 

The  Cup9(82-306)-LHIV-DHFR-myc  reporter  was  expressed  alone  or  together 
with  Nef(flag)2PR  in  yeast  cells  and  its  cleavage  was  studied  in  Western  blots  with  anti- 
myc  antibodies  (Fig.4A).  In  the  absence  of  HIV  PR,  Cup9(82-306)-LHIV-DHFR-myc 
was  migrating  as  a  single  band  in  the  area  of  the  gel  consistent  with  its  deduced 
molecular  weght  of  49  kD.  In  the  presence  of  HIV  PR  this  band  decreased  in  intensity 
and  the  new  very  prominent  faster  migrating  band  corresponding  to  the  DHFR-myc 
cleavage  product  appeared  (Fig.  4A).  In  another  experiment  we  pulse-labeled  the  yeast 
cells  for  1  hour,  lysed  them  and  immunoprecipitated  the  proteins  with  the  anti-myc 
antibodies  (Fig.  4B).  This  experiment  confirmed  the  results  of  the  Western  blot.  During 
1  hour  of  labeling  in  the  presence,  but  not  in  the  absence  of  HIV  PR  the  full-length 
Cup9(82-306)-LHIV-DHFR-myc  reporter  was  cleaved  and  the  cleavage  product 
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accumulated  to  significant  amounts  (Fig.  4B).  We  concluded  that  the  reporter  Cup9(82- 
306)-LHIV-DHFR-myc  was  cleaved  efficiently  by  HIV  PR  in  our  system. 

We  were  somewhat  surprised  that  the  intensity  of  the  band  of  the  cleavage 
product  on  the  Western  blot  and  immunoprecipitation  appeared  much  stronger  than  the 
intensity  of  the  band  of  the  original  full-length  fusion.  There  are  two  possible 
explanations  for  this  observation.  First,  the  full-length  Cup9-containing  reporter  might  be 
tightly  bound  to  DNA  in  the  nucleus,  and  our  extraction  procedure  of  vortexing  with 
glass  beads  in  buffer  containing  1%  Triton  X-100  and  0.15  M  NaCl  might  not  be  suitable 
for  its  complete  recovery.  Second,  because  the  full-length  reporter  fusion  is  expected  to 
be  an  unstable  protein  due  to  the  presence  of  a  degradation  signal  inside  Cup9,  its  low 
level  may  be  due  to  its  relatively  fast  degradation;  in  contrast,  since  DHFR-myc  cleavage 
product  is  metabolically  stable,  it  may  accumulate.  To  address  the  first  possibility,  we 
employed  an  alternative  extraction  procedure  of  preparing  yeast  spheroplasts  by 
treatment  with  lyticase  and  subsequently  lysing  them  in  the  SDS-loading  buffer.  This 
very  vigorous  extraction  protocol  produced  the  same  result  on  the  Western  blot  as  our 
original  method  (data  not  shown).  Although  the  possibility  of  incomplete  extraction  of 
the  full-length  fusion  can  not  be  completely  excluded,  this  experiment  speaks  against  it. 
To  address  the  second  possibility,  we  performed  pulse-chase  analysis  (Fig.  4C).  During 
the  course  of  the  30  min  chase  in  the  presence  of  HIV  PR  the  DHFR-myc  product 
accumulated  significantly.  Without  HIV  PR  the  reporter  fusion  appeared  only  slightly 
unstable  in  this  pulse-chase  assay  (about  80%  left  after  30  min  of  chase).  Therefore,  the 
increased  intensity  of  the  band  of  the  DHFR-myc  truncated  product  compared  to  the  band 
of  the  full-length  reporter  can  not  be  entirely  due  to  their  differential  metabolic  stabilities. 
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Currently  we  do  not  have  a  full  explanation  of  this  phenomenon.  In  any  case,  we 
concluded  that  the  reporter  protein  Cup9(82-306)-LHIV-DHFR-myc  was  efficiently 
cleaved  by  HIV  PR.  Therefore,  we  proceeded  to  the  construction  of  a  conditional  toxin 
based  on  this  reporter  fusion. 

To  construct  a  conditional  toxin,  we  replaced  the  DHFR-myc  portion  of 
the  reporter  fusion  Cup9(82-306)-LHIV-DHFR-myc  with  DTA.  The  resulting  construct 
was  called  Cup9(82-306)-LHIV-DHFR-myc  (or  simply  Tox)  and  was  used  as  the 
working  conditional  toxin.  In  addition,  we  constructed  a  mutant  version  of  this 
conditional  toxin  by  replacing  the  HIV  PR  cleavage  site  VSFNF/PQITL  with 
VSFNV7PQITL.  The  F  to  V  point  mutation  at  the  cleavage  junction  was  previously 
reported  to  inhibit  the  HIV  PR  cleavage  dramatically  [Loeb,  1989  #282].  We  confirmed 
that  in  the  context  of  our  DHFR-containing  reporter  fusion  the  mutated  cleavage  site  was 
cleaved  very  inefficiently  (data  not  shown).  The  mutated  conditional  toxin  bearing  the  F 
to  V  point  mutation  at  the  HIV  PR  cleavage  site  was  named  Cup9(82-306)-LHIVmut- 
DTA  (or  simply  mTox).  Tox  and  mTox  were  expressed  in  yeast  under  the  control  of  Gall 
promoter  from  the  centromeric  vector  p416Gall  in  the  presence  or  in  the  absence  of  HIV 
PR*.  After  growing  the  cells  in  glucose  in  the  presence  of  methionine,  the  expression  of 
Tox,  mTox  and  HIV  PR  was  induced  and  viability  of  the  cells  measured  by  growing  the 
yeast  on  galactose-containing  minimal  medium  without  methionine  on  Petri  plates.  We 
observed  that  coexpression  of  Tox,  but  not  mTox  with  HIV  PR  resulted  in  significant 
increase  of  toxicity  (Fig.  5).  The  fact  that  the  F  to  V  substitution  at  the  HIV  PR  cleavage 

*  DTA-containing  plasmids  had  dramatically  lower  efficiencies  of  transfecting  yeast.  This  might 
be  due  to  a  spike  ot  transient  unregulated  expression  of  the  protein  fusions  during  or 
immediately  after  the  transfection  procedure. 
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site  interfered  with  the  toxicity  indicates  that  the  toxic  effect  we  observed  was  cleavage- 
mediated.  Thus,  the  fusion  protein  Cup9(82-306)-LHIV-DTA  works  as  a  cleavage- 
mediated  conditional  toxin. 

As  was  described  above,  the  cleavage-specific  toxic  effect  that  we 
observed  may  have  two  underlying  contributing  mechanisms.  First,  cleavage  may  lead  to 
metabolic  stabilization  and,  as  a  result,  intracellular  accumulation  of  DTA.  Second, 
cleavage  may  interfere  with  nuclear  localization  of  DTA  leading  to  its  greater  toxicity. 
Both  of  these  possibilities  are  likely  and  may,  in  fact,  contribute  together  to  the  observed 
cleavage-mediated  toxicity.  In  addition,  it  can  not  be  excluded  that  the  Cup9  portion  of 
the  toxin  interferes  with  DTA  toxicity. 

Our  system  represents  a  fast  experimental  approach  to  construct  and  test  a 
conditional  toxin  directed  against  cell  containing  HIV  PR.  Recently  Vocero-Akbani  et  al. 
reported  that  a  modified  apoptotic  protease  caspase-3  containing  an  HIV  PR  cleavage  site 
demonstrates  preferential  toxicity  against  HIV-infected  cells.  The  work  of  these  authors 
apparently  presents  another  attempt  to  construct  a  cleavage-mediated  conditional  toxin. 
Taken  together,  these  attempts  could  eventually  lead  to  creating  a  clinically  useful  toxic 
molecule  to  eradicate  HIV-infected  cells. 

A  modification  of  our  system  could  also  be  useful  as  a  genetic  test  for 
enzymatic  activities  of  different  mutant  versions  of  HIV  PR.  Thus,  HIV  PR  sequences 
from  different  HIV  isolates  could  be  PCR-amplified,  inserted  into  the  yeast  expression 
vector,  and  their  enzymatic  activities  estimated  by  co-expression  with  a  conditional  toxin 
and  monitoring  the  yeast  cell  survival.  In  yet  another  possible  application  of  this  system, 
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it  could  be  used  to  measure  efficiencies  of  recognition  and  cleavage  of  different  sites  by 
HIV  PR. 


In  sum,  we  constructed  a  cleavage-mediated  conditional  toxin  and 
demonstrated  it’s  preferential  toxicity  against  HIV  PR-expressing  yeast. 
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FIGURE  LEGENDS 


FIGURE  1 .  Cleavage-mediated  conditional  toxin  (sitoxin).  In  the  sitoxin  idea  the 
toxicity  of  a  protein  fusion  is  enhanced  as  a  result  of  proteolytic  cleavage  by  HIV  PR. 
HIV  PR  cleavage  removes  the  inactivating  domain  that  inhibits  the  activity  of  the 
cytotoxic  domain  as  a  part  of  the  fusion. 

FIGURE  2.  Constructs  used  in  the  current  study.  Arrows  over  the  Nef(flag)2PR 
construct  indicate  the  sites  of  self-cleavage  of  HIV  PR  from  the  fusion.  Sizes  of  the 
boxes  on  this  illustration  were  not  intended  to  be  exactly  proportional  to  the  sizes  of  the 
corresponding  DNA  fragments. 

FIGURE  3.  Toxicity  of  Cup9-containing  fusions  upon  overexpression  in  yeast. 

Yeast  were  transformed  with  recombinant  plasmids  encoding  the  corresponding  protein 
fusions  under  the  control  of  Gall  promoter  or  an  empty  vector  as  indicated.  Yeast 
cultures  were  processed  as  described  in  Experimental  Procedures  and  distributed  over 
Petri  plates  with  galactose-containing  medium  to  induce  protein  expression.  Picture  was 
taken  after  4  days  of  growth  on  the  Petri  plate. 

FIGURE  4.  Cleavage  by  the  HIV  PR  of  the  reporter  fusion  Cup9(82-306)-LHIV- 
DHFR-myc  upon  co-expression  in  yeast.  The  black  arrow  indicates  the  original 
Cup9(82-306)-LHIV-DHFR-myc  protein  fusion.  The  white  arrow  indicates  its  cleavage 
product  DHFR-myc.  Molecular  weights  of  the  markers  (in  kDa)  are  indicated  on  the 
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right  side  of  each  panel,  a.  Western  blot  with  anti-myc  antibodies.  The  reporter  fusion 
Cup9(82-306)-LHIV-DHFR-myc  was  expressed  either  alone  (lane  1)  or  together  with 
HIV-PR  (lane  2).  b.  Immunoprecipitations  with  anti-myc  antibodies  (Berkeley  Antibody 
Company,  CA).  Yeast  expressing  no  foreign  proteins  (lane  1),  the  reporter  fusion 
Cup9(82-306)-LHIV-DHFR-myc  alone  (lane  2)  or  together  with  HIV  PR  (lane  3)  were 
labeled  for  1  hour  with  a  mixture  of  35S-methionine  and  35S-cystein.  Protein  extracts  were 
prepared,  immunoprecipitated  with  anti-myc  antibodies  and  resolved  on  SDS-PAGE.  c. 
Pulse-chase  assays.  Yeast  cells  were  expressing  Cup9(82-306)-LHIV-DHFR-myc 
reporter  protein  and  HIV-PR  as  indicated.  Yeast  were  pulse-labeled  with  a  mixture  of 
35S-methionine  and  35S-cystein  for  165  sec  and  chased  in  the  excess  of  cold  methionine 
and  cystein  and  200  |!g/ml  cyclohexamide  for  0,  10  and  30  min  as  marked.  After  the 
chases  protein  extracts  were  prepared,  immunoprecipitated  with  anti-myc  antibodies  and 
resolved  on  SDS-PAGE. 

figure  5.  Toxicity  of  Tox,  but  not  mTox,  is  enhanced  in  the  presence  of  HIV 

PR.  Yeast  were  transformed  with  recombinant  plasmids  encoding  the  corresponding 
proteins  (+)  or  empty  vectors  (-)  as  indicated.  Yeast  cultures  were  processed  as  described 
in  Experimental  Procedures  and  distributed  over  Petri  plates  with  galactose-containing 
medium  to  induce  Tox  and  mTox  under  the  control  of  Gall  promoter  and  no  methionine 
to  induce  HIV  PR  under  the  control  of  Met25  promoter.  Pictures  were  taken  after  5  days 
of  growth  on  Petri  plates. 
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Protein  degradation  by  the  ubiquitin  (Ub)  system  controls  the  intracellular 

concentrations  of  many  regulatory  proteins.  The  degradation  signals  presented  by 

these  proteins  are  recognized  by  the  E3  components  of  the  Ub  system1'8.  Here  we 

describe  the  first  example  of  physiological  regulation  of  a  Ub-dependent  pathway 

through  allosteric  modulation  of  its  E3  activity  by  small  compounds.  Ubrlp,  the 

E3  of  the  N-end  rule  pathway  in  Saccharomyces  cerevisiae,  mediates  the 

Ub/proteasome-dependent  degradation  of  Cup9p,  a  transcriptional  repressor  of  the 

peptide  transporter  Ptr2p4  Ubrlp  also  targets  proteins  that  bear  destabilizing 

N-terminal  residues8.  Here  we  show  that  the  degradation  of  Cup9p  is 

allostencally  activated  by  dipeptides  bearing  destabilizing  N-terminal  residues. 

In  the  resulting  positive  feedback  circuit,  imported  dipeptides  bind  to  Ubrlp  and 

accelerate  the  Ubrlp-dependent  degradation  of  Cup9p,  thereby  derepressing  the 

expression  of  Ptr2p  and  increasing  the  cell's  capacity  to  import  peptides.  These 

findings  identify  the  physiological  rationale  for  the  targeting  of  Cup9p  by  Ubrlp, 

and  suggest  that  small  compounds  may  regulate  other  Ub-dependent  pathways  as 
well. 

A  protein  target  of  the  Ub  system  is  conjugated  to  Ub  through  the  action  of 
three  enzymes,  El,  E2  and  E32A6  The  resulting  ubiquitylated  protein  bears  a 
covalently  linked  multi-Ub  chain,  and  is  degraded  by  the  26S  proteasome7.  The 
selectivity  of  ubiquitylation  is  determined  mainly  by  the  E3,  which  recognizes  a 
degradation  signal  (degron)  carried  by  the  target  protein.  The  rate  of  degradation  of 
specific  proteins  is  often  regulated,  in  response  either  to  signals  from  the 
environment  or  to  signals  produced  intracellularly,  for  example,  during  cell  cycle 
progression.  In  many  cases,  this  is  achieved  by  modulating  the  exposure  or  the 
structure  of  a  degron  in  a  target  protein.  For  example,  the  degrons  of  the 
cyclm-dependent  kinase  inhibitors  Siclp  and  p27  are  activated  by  phosphorylation, 
which  is  timed  to  brmg  about  their  destruction  at  key  transition  points  in  the  cell 
cycle8.  In  other  cases,  phosphorylation  regulates  the  activity  of  an  E3  itself.  For 
example,  the  anaphase-promoting  complex  (APC),  a  multisubunit  E3,  is  activated 
only  at  mitosis9. 
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One  Ub-dependent  proteolytic  system,  termed  the  N-end  rule  pathway, 
targets  proteins  carrying  a  degradation  signal  called  the  N-degron™.  The  essential 
determinants  of  an  N-degron  are  a  destabilizing  N-terminal  residue  and  a  lysine 
residue  of  the  substrate11.  The  N-end  rule  pathway  is  present  in  all  organisms 
examined,  from  mammals  and  fungi  to  bacteria*.  In  S.  cerevisiae,  there  are  two 
classes  of  destabilizing  residues,  basic,  or  type  1  (Arg,  Lys  and  His)  and  bulky 
hydrophobic,  or  type  2  (Phe,  Leu,  Tyr,  Trp  and  lie).  Ubrlp,  a  225K,  RING-H2 
finger-containing  E3,  directly  recognizes  these  N-terminal  residues5'™.  The  lysine 
determinant  of  an  N-degron  is  the  site  of  formation  of  a  substrate-linked  multi-Ub 
chain,  which  is  produced  by  a  complex  of  the  E3  Ubrlp  and  the  E2  enzyme  Rad6p 
(Ubc2p)14  Dipeptides  bearing  a  destabilizing  N-terminal  residue  of  either  basic  or 
hydrophobic  type  act  as  competitive  inhibitors  of  the  degradation  of  N-end  rule 
substrates  carrymg  the  same  type  of  destabilizing  residue15*!?  Thus  Ubrlp  contains 
two  distinct  N-terminal  residue-binding  sites  that  are  each  capable  of  binding  either 
a  dipeptide  or  a  protein,  but  not  both  at  the  same  time. 

The  N-end  rule  pathway  was  discovered  through  the  use  of  engineered 
reporter  proteins™.  Recent  work  identified  the  first  physiological  function  of  Ubrlp 
m  S.  cerevisiae,  by  demonstrating  that  Ubrlp  is  required  for  peptide  import™.  Ubrlp 
regulates  the  uptake  of  peptides  by  controlling  degradation  of  the  35K  homeodomain 
protein  Cup9p,  a  transcriptional  repressor  of  the  di-  and  tripeptide  transporter 
Ptr2p4  In  contrast  to  the  canonical  N-end  rule  substrates,  Ubrlp  targets  Cup9p 
through  a  degron  located  in  the  C-terminal  half  of  Cup9p  (F.  Navarro-Garcia, 

G.  Turner  and  A.  Varshavsky,  unpublished  data).  Despite  this  unexpected  mode  of 
recognition,  we  asked  whether  dipeptides  bearing  destabilizing  N-terminal  residues 
could  affect  the  Ubrlp-mediated  degradation  of  Cup9p,  since  dipeptides  are  able  to 
inhibit  degradation  of  canonical  N-end  rule  substrates1?. 

To  address  this  question,  it  was  necessary  to  produce  a  variant  of  the  Cup9p 
repressor  that  could  be  moderately  overexpressed  for  pulse-chase  analyses  without 

influencing  the  expression  of  Ptr2p  and  the  uptake  of  dipeptides.  An 

Asn  (N)  Ser  (S)  substitution  at  position  265  of  Cup9p,  within  the  recognition  helix 
of  the  homeodomain19,  was  predicted  to  strongly  reduce  the  affinity  of  Cup9p  for 
DNA20.  The  N265S  substitution  did  not  alter  the  in  vivo  degradation  of  Cup9p  (data 
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not  shown).  This  Cup9p  derivative,  tagged  at  the  C-terminus  with  the  FLAG 
epitope  and  denoted  as  Cup9pNSF/  was  expressed  as  part  of  a  fusion  of  the  form 
FDHFR-Ub-Cup9pNSF/  where  fDHFR  was  the  N-terminally  FLAG-tagged  mouse 
dihydrofolate  reductase.  Ub-spedfic  proteases  (UBPs)  co-translationally  cleave  this 
UPR  (Ub /protein/reference)  fusion  at  the  Ub-Cup9p  junction,  yielding  the  test 
protein  Cup9pNSF  and  the  long-lived  FDHFR-Ub  reference  protein,  which  serves  as 
an  internal  control  for  variations  in  expression  levels  and  immunoprecipitation 

efficiency  21/22 

Cells  expressing  Cup9pNSF  were  grown  in  minimal  medium  containing 
allantom  as  the  nitrogen  source  to  avoid  the  known  effects  of  nitrogen  catabolite 
repression  on  PTR2  expression 23.  Leu-Ala  and  Arg-Ala,  dipeptides  bearing  either 
type  of  destabilizing  N-terminal  residue  (Leu,  bulky  hydrophobic;  Arg,  basic),  were 
added  to  the  medium  prior  to  pulse-chase  analysis,  to  a  final  concentration  of  10  mM 
(see  Methods).  This  dipeptide  concentration  results  in  maximal  inhibition  of 
degradation  of  N-end  rule  substrates17.  Strikingly,  the  addition  of  either  Leu-Ala  or 
Arg-Ala  exerted  an  opposite  effect  on  Cup9pNSF,  strongly  accelerating  its  degradation 
m  wildtype  (UBR1)  cells.  The  half-life  of  Cup9pNSF  decreased  from  ~5  min  in  the 
absence  of  dipeptides  (Fig.  lc)  to  less  than  1  min  in  their  presence  (Fig.  lb).  This 
stimulatory  effect  was  not  observed  in  a  ubrlA  strain,  indicating  that  the  augmented 
degradation  of  Cup9pNSF  was  dependent  on  Ubrlp.  The  enhancement  of 

degradation  required  dipeptides  bearing  destabilizing  N-terminal  residues:  Ala-Leu 
and  Ala- Arg,  which  bore  a  stabilizing  N-terminal  residue  but  had  the  same  amino 
acid  composition  as,  respectively,  Leu-Ala  and  Arg-Ala,  did  not  affect  the 

degradation  of  Cup9pNSF  (t1/2  ~5  min)  (Fig.  lb  and  data  not  shown).  Similar  results 

were  obtained  with  cells  expressing  Cup9NSF  that  was  not  a  part  of  a  UPR  fusion 
(data  not  shown). 

To  determine  the  concentration  dependence  of  the  stimulation,  we  measured 
the  degradation  of  Cup9p  at  a  range  of  concentrations  of  Trp-Ala,  another  peptide 
bearing  a  hydrophobic  (type  2)  destabilizing  N-terminal  residue.  The  enhancement 
of  Cup9pNSF  degradation  was  detectable  at  1  pM  Trp-Ala,  the  lowest  concentration 

tested  (t1/2  ~1  min)  (Fig.  lc).  In  contrast,  the  degradation  of  Cup9pNSF  was  not 
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altered  either  by  Ala-Trp  or  by  Trp  and  Ala,  the  free  amino  acid  components  of 
Trp-Ala  and  Ala-Trp  (Fig.  lc).  These  results  indicate  that  the  relevant  signaling 
molecule  in  this  process  is  a  dipeptide  bearing  a  destabilizing  N-terminal  residue. 
Experiments  with  other  dipeptides  carrying  destabilizing  N-terminal  residues 
(Leu- Ala  and  Arg-Ala)  yielded  similar  results  (data  not  shown).  Thus,  the  range  of 
dipeptide  concentrations  that  significantly  stimulated  Cup9p  degradation  was 
similar  to  physiologically  active  levels  of  many  other  nutrients.  Interestingly,  these 
concentrations  (1-10  pM)  were  1,000  to  10, 000-fold  lower  than  the  concentration 
(~10  mM)  necessary  for  maximal  inhibition  of  the  Ubrlp-dependent  degradation  of 
protein  substrates  bearing  N-degrons17.  One  possibility  is  that  inhibiting 
N-degron-mediated  proteolysis  requires  a  dipeptide  and  a  protein  substrate  to 
compete  for  the  same  binding  site  of  Ubrlp,  while  no  such  competition  exists  when  a 
dipeptide  stimulates  Cup9p  degradation,  since  the  dipeptide  and  Cup9p  bind  to  two 
different  sites  of  Ubrlp.  It  is  also  possible  that  only  a  small  fraction  of  a  cell's  Ubrlp 
molecules  have  to  be  activated  to  enhance  the  degradation  of  Cup9p,  while  the 
Ubrlp  pool  may  have  to  be  almost  saturated  by  dipeptides  to  suppress  the  targeting 
of  a  protein  bearing  an  N-degron.  Irrespective  of  the  underlying  mechanism,  the 
above  results,  together  with  the  observation  that  a  protein  bearing  a  destabilizing 
N-terminal  residue  can  be  co-expressed  with  Cup9p  without  altering  the  kinetics  of 
Cup9p  degradation  (data  not  shown),  suggest  that  cells  are  capable  of  independently 
regulating  the  Ubrlp-dependent  degradation  of  proteins  bearing  N-degrons  and 
internal  degrons. 

Cup9p  represses  transcription  of  the  transporter-encoding  PTR2  gene4.  Thus, 
the  dipeptide-induced  acceleration  of  Cup9p  degradation  would  be  expected  to 
increase  the  levels  of  PTR2  mRNA,  ultimately  leading  to  an  increase  in  dipeptide 
uptake.  In  this  circuit,  dipeptides  bearing  destabilizing  N-terminal  residues  act  as 
effectors  of  a  Ubrlp-based  positive  feedback  loop,  enhancing  their  own  uptake  by 
accelerating  the  Ubrlp-mediated  degradation  of  Cup9p.  This  conjecture  was  tested 
by  examining  the  levels  of  PTR2  mRNA  in  the  presence  or  absence  of  dipeptides  in 
the  medium.  At  25  pM,  both  Trp-Ala  and  Arg-Ala  induced  PTR2  expression  in  the 
wildtype  ( UBR1 )  strain  (Fig.  2a).  The  effect  of  Trp-Ala  was  particularly  strong, 
resulting  in  levels  of  PTR2  mRNA  comparable  to  that  of  actin  ( ACT1 )  mRNA 
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(Fig.  2a).  Both  Ubrlp  and  Cup9p  were  required  for  these  effects,  since  the  expression 
of  PTR2  was  not  altered  by  dipeptides  in  ubrlA  and  cup9A  strains.  Testing  a  range  of 
concentrations  of  Trp-Ala  showed  that  induction  of  PTR2  mRNA  could  be  observed 
at  1  pM  Trp-Ala,  increased  substantially  at  10  pM  Trp-Ala,  and  increased  more 
gradually  at  higher  concentrations,  in  agreement  with  the  observed  changes  in  the 
half-life  of  Cup9p  at  different  levels  of  Trp-Ala  (Fig.  lc). 

A  plausible  mechanism  of  the  enhancement  effect  is  that  a  dipeptide  interacts 
with  either  the  basic  or  hydrophobic  N-terminal  residue-binding  sites  of  Ubrlp, 
while  a  distinct  (third)  substrate-binding  site  of  Ubrlp  recognizes  the  internal  degron 
of  Cup9p.  In  this  model,  the  interaction  of  Ubrlp  with  dipeptides  allosterically 
increases  the  ability  of  the  Ubrlp-Rad6p  (E3-E2)  complex  to  ubiquitylate  Cup9p. 

To  test  whether  dipeptides  act  directly  through  Ubrlp,  we  examined  the  effect  of 
dipeptides  on  Cup9p  ubiquitylation  in  an  in  vitro  system  consisting  of  the  following 
purified  components:  Ubrlp  (E3),  Rad6p  (E2),  Ubalp  (El),  Ub,  ATP,  and 
radiolabeled  Cup9p.  In  this  system,  Cup9p  was  significantly  multi-ubiquitylated,  in 
a  Ubrlp/Rad6p-dependent  reaction  (data  not  shown),  in  the  absence  of  added 
dipeptides  (Fig.  3).  This  result  was  consistent  with  the  relatively  rapid  in  vivo 
degradation  of  Cup9p  (t1/2  ~5  min)  in  the  absence  of  dipeptides  (Fig.  lc). 

The  addition  of  dipeptides  bearing  either  type  of  destabilizing  N-terminal 
residue  to  the  in  vitro  system  substantially  stimulated  the  Ubrlp-dependent 
multi-ubiquitylation  of  Cup9p  (Fig.  3).  Dipeptides  of  the  same  composition  but 
bearmg  a  stabilizing  N-terminal  residue  did  not  stimulate  multi-ubiquitylation,  nor 
did  the  amino  acid  components  of  these  dipeptides  (Fig.  3).  Given  the  composition 
of  this  in  vitro  system,  our  results  demonstrate  that  dipeptides  act  directly  through 
Ubrlp,  without  an  intermediate  signaling  pathway.  The  underlying  allosteric 
mechanism  may  involve  increased  affinity  of  Ubrlp  for  Cup9p,  or  enhanced 
ubiquitylation  activity  of  the  Ubrlp-Rad6p  complex  towards  Cup9,  or  both. 

The  findings  of  this  work  show  that  the  two  binding  sites  of  Ubrlp  that 
mteract  with  destabilizing  N-terminal  residues  have  a  specific  physiological  function 
as  allosteric  effector  binding  sites.  These  sites  enable  Ubrlp  to  sense  the  presence  of 
imported  dipeptides,  and  to  induce  the  expression  of  the  Ptr2p  transporter 
accordingly,  by  accelerating  degradation  of  the  Cup9p  repressor.  The  resulting 
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understanding  of  the  regulation  of  peptide  import  is  summarized  in  Figure  4a-d. 

This  model  predicts  that  a  dipeptide  bearing  a  destabilizing  N-terminal  residue,  e.  g.. 
Leu- Ala,  should  stimulate  its  own  uptake,  in  contrast  to  Ala-Leu,  a  dipeptide  of  the 
same  composition  bearing  a  stabilizing  N-terminal  residue.  This  prediction  was 
borne  out  when  we  tested  the  ability  of  these  two  leucine-containing  dipeptides  to 

support  the  growth  of  S.  cerevisiae  auxotrophic  for  leucine.  This  Leu  strain  grew 

robustly  on  plates  supplemented  with  230  pM  Leu-Ala,  but  it  did  not  form  visible 
colonies  in  the  presence  of  230  pM  Ala-Leu  (Fig.  4e). 

Food  sources  that  S.  cerevisiae  encounters  outside  the  laboratory  setting  are 
likely  to  contain  mixtures  of  short  peptides,  a  subset  of  which  would  be  capable  of 
activating  the  Ubrlp-based  positive  feedback  circuit,  since  12  of  the  20  amino  acid 
residues  are  destabilizing  in  the  yeast  N-end  rulel2  We  modeled  this  situation  by 
providmg  cells  with  a  mixture  of  Leu-Ala  at  a  low  concentration  (2  pM)  and  Ala-Leu 
at  a  high  concentration  (230  pM).  Although  neither  dipeptide  supplement  alone 
could  satisfy  the  strain's  requirement  for  leucine,  a  mixture  of  the  two  dipeptides 
supported  robust  growth  (Fig.  4e).  Moreover,  a  mixture  of  Ala-Leu  (230  pM)  and  a 
dipeptide  lacking  leucine  but  bearing  a  destabilizing  N-terminal  residue  (10  pM 
Arg-Ala  or  1  pM  Lys-Ala)  also  rescued  growth.  In  contrast,  leucine-lacking 
dipeptides  bearing  stabilizing  N-terminal  residues,  or  the  amino  acid  components  of 

these  dipeptides,  could  not  rescue  the  growth  of  Leu-  cells  in  the  presence  of  230  pM 
Ala-Leu  (Fig.  4e).  These  results  are  predicted  by  the  model  described  in  Fig.  4a~d,  in 
that  a  dipeptide  bearing  a  destabilizing  residue  is  expected  to  accelerate  the  import  of 

all  dipeptides,  thereby  enabling  the  uptake  of  sufficient  Ala-Leu  to  satisfy  the  strain's 
leucine  requirement. 

This  work  establishes  for  the  first  time  that  the  activity  of  an  E3  can  be  directly 
linked  to  the  presence  of  an  environmental  signal  through  an  allosteric  interaction 
with  a  small  compound.  Specifically,  dipeptides  bearing  destabilizing  N-terminal 
residues  are  shown  to  act  as  allosteric  activators  of  Ubrlp,  enhancing  its  ability  to 
support  the  ubiquitylation  and  degradation  of  Cup9p.  Physiologically,  this  results  in 
a  positive  feedback  circuit  governing  the  uptake  of  peptides.  By  binding  to  Ubrlp, 
the  imported  dipeptides  accelerate  degradation  of  Cup9p,  thereby  derepressing  the 
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synthesis  of  the  Ptr2p  transporter  and  enhancing  the  cell's  ability  to  import  di-  and 
tripeptides. 

Most  eukaryotic  cells  have  the  capacity  to  import  peptides.  The  sequence 
conservation  of  both  the  basic  and  hydrophobic  N-terminal  residue-binding  sites 
between  the  mouse  and  yeast  Ubrlp  proteins24  suggests  that  the  peptide-sensing 
physiological  functions  of  these  sites  may  be  conserved  among  eukaryotes.  In 
metazoans,  many  neurotransmitters  and  hormones  bear  destabilizing  N-terminal 
residues,  so  it  is  possible  that  some  of  these  signaling  molecules  act  at  least  in  part 
intracellularly,  by  modulating  Ubrlp-mediated  protein  degradation.  The 

ClpAP-dependent  N-end  rule  pathway  of  E.  coli  25,  whose  physiological  functions 
are  unknown,  may  also  play  a  role  in  regulating  the  import  of  peptides  or  related 
compounds,  by  analogy  with  the  S.  cerevisiae  N-end  rule  pathway. 

The  discovery  of  peptide-mediated  positive  feedback  (Fig.  4)  identifies  the 
physiological  rationale  for  control  of  peptide  import  through  the  Ubrlp-dependent 
degradation  of  Cup9p:  the  N-terminal  residue-binding  sites  of  Ubrlp  enable  it  to 
sense  intracellular  dipeptide  concentrations  and  modulate  the  degradation  of  Cup9p 
accordingly.  In  addition,  our  results  expand  the  set  of  mechanisms  that  regulate 
protein  degradation  from  the  previously  known  modulatory  phosphorylation  of 
degrons  and  E3's  to  allosteric  regulation  of  an  E3  by  a  small  compound.  The  Ub 
system  is  either  known  or  suspected  to  play  major  roles  in  the  control  of 

intermediary  metabolism  and  the  transport  of  small  molecules  across  membranes2^. 
Our  findings  suggest  that  these  compounds,  or  their  enzymatically  produced 
derivatives,  may  modulate  the  functions  of  E3s  in  the  Ub  system  similarly  to  the 
effects  observed  here  with  dipeptides  and  Ubrlp. 
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Methods 

Yeast  strains  and  plasmids.  The  S.  cerevisiae  strains  used  in  pulse-chase 
experiments,  JD52  (MATa  lys2-801  ura3-52  trpl-A63  his3-A200  leu2-3,112),  JD55 
(ubrlA::H!S3),  were  described  previously26.  FDHFR-Ub-Cup9pNSF  was  expressed 

from  the  PMET25  promoter  on  the  centromeric  vector  p416MET2527.  Construction 
details  are  available  upon  request.  Strains  used  for  Northern  analyses  were  AVY30 
(MATa  leu2-3,112  ubrlA::LEU2),  AVY  31  (MATa  leu2-3, 112  cup9A::LEU2),  and  AVY32 
(MATa  LEU2),  constructed  in  the  RJD350  background  (MATa  leu2-3,112;  a  gift  from 
R.  Deshaies)  using  restriction  fragments  obtained  from  plasmids  pSOB305,  pCB1194 
and  pJJ25228,  respectively.  The  RJD350  strain  was  used  for  the  colony  formation 
assay  (Fig.  4e).  SHM  plates4  were  supplemented  with  dipeptides  or  amino  acids  at 
the  following  concentrations:  230  gM  Leu-Ala  (or  2  gM  where  noted),  230  gM 
Ala-Leu,  230  gM  each  of  Leu  and  Ala,  10  gM  Arg-Ala,  10  gM  Ala-Arg,  1  gM  Lys-Ala, 
1  gM  Ala-Lys,  10  gM  each  of  Arg  and  Ala,  or  1  gM  each  of  Lys  and  Ala. 

Pulse-chase  analysis.  Cells  were  cultured  in  SF1M4  with  auxotrophic 
supplements.  Dipeptides  were  added  to  cultures  at  OD600  ~0.6  and  incubation 
continued  for  2.5  h  in  the  experiments  in  Fig  lb,  and  for  30  min  for  Fig  lc.  Cells  were 
harvested,  washed  in  0.8  ml  of  SHM,  resuspended  in  0.4  ml  of  SHM,  and  labeled  for 
5  min  at  30°C  with  0.16  mCi  of  35S-EXPRESS  (New  England  Nuclear).  Cells  were 
pelleted,  and  resuspended  in  fresh  SHM  containing  4  mM  L-methionine  and  2  mM 
L-cysteine.  0.1  ml  samples  were  taken  at  the  time  points  indicated  and  transferred  to 
chilled  tubes,  each  containing  0.5  ml  of  0.5-mm  glass  beads,  0.7  ml  of  ice-cold  lysis 
buffer  (1%  Triton-XlOO,  0.15  M  NaCl,  5  mM  EDTA,  50  mM  Na-HEPES,  pH  7.5),  and  a 
mixture  of  protease  inhibitors  (final  concentrations  1  mM  phenylmethylsulfonyl 
fluoride,  2  gg/ml  aprotinin,  0.5  gg/ ml  leupeptin,  and  0.7  gg/ ml  pepstatin).  Extracts 
were  prepared  and  immunoprecipitations  carried  out  as  described26,  using 
anti-FLAG  M2  resin  (Sigma).  Immunoprecipitates  were  fractionated  by 
SDS-13%  PAGE,  and  detected  by  autoradiography. 

RNA  preparation  and  Northern  analysis.  Cells  were  cultured  in  SHM  to 
OD600  -0.6.  The  indicated  dipeptides  were  then  added  to  a  25  gM  final 
concentration,  and  the  incubation  was  continued  for  an  additional  30  min.  Total 
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RNA  was  prepared29,  and  25  pg  samples  were  electrophoresed  in  1%  formaldehyde- 
agarose  gels,  followed  by  blotting  for  Northern  analysis^. 

Ubrlp-dependent  in  vitro  ubiquitylation  system.  The  components  of  this 
system  were  purified  as  follows.  N-terminally  hexahistidine-tagged  Ubalp  was 
overexpressed  in  S.  cerevisiae  and  purified  by  fractionation  over  Ni-NTA,  ubiquitin 
affinity,  and  Superdex-200  columns.  Rad6p  was  overexpressed  in  E.  coli,  and 
purified  by  fractionation  over  DEAE,  Mono-Q  and  Superdex-75  columns. 
N-terminally  FLAG-tagged  Ubrlp  was  overexpressed  in  S.  cerevisiae,  and  purified  by 
fractionation  over  anti-FLAG  M2,  Ubc2p  affinity,  and  Superdex-200  columns. 
N-terminally  FLAG-tagged,  C-terminally  hexahistidine-tagged  Cup9p  was  expressed 
and  radiolabeled  in  E.  coli,  and  purified  by  fractionation  over  Ni-NTA  and  anti-FLAG 
M2  columns.  Details  of  these  expression  and  purification  protocols  are  available 
upon  request. 

The  in  vitro  ubiquitylation  reactions  contained  the  following  components: 

7  (J.M  Ub,  50  nM  Ubalp,  50  nM  Rad6p,  50  nM  Ubrlp,  550  nM  ^S-labeled  Cup9p, 

25  mM  HEPES/KOH  (pH  7.5),  25  mM  KC1, 5  mM  MgCl2, 2  mM  ATP,  0.1  mM 

dithiothreitol,  and  0.5  mg/ ml  ovalbumin  as  a  carrier  protein.  Dipeptides  or  amino 
acids,  as  indicated,  were  added  to  a  final  concentration  of  2  (xM  (top  panel.  Fig.  3)  or 
10  pM  (bottom  panel.  Fig.  3).  All  components  except  Ubalp  were  mixed  on  ice  for 
10  minutes;  Ubalp  was  then  added  and  reactions  shifted  to  30°C.  After  the 
indicated  times,  the  reactions  were  terminated  by  adding  an  equal  volume  of 
2  x  SDS-PAGE  loading  buffer  and  heating  at  95°C  for  5  min,  followed  by 
8%  SDS-PAGE. 
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Figure  Legends 

Figure  1.  Enhancement  of  Cup9p  degradation  by  dipeptides  bearing 
destabilizing  N-terminal  residues,  a.  The  fusion  protein  used  for  pulse-chase 
analysis.  The  stable  FDHFR-Ub  reference  portion  of  the  fusion  is  co-translationally 
cleaved  from  Cup9NSF  by  UBPs.  b.  Pulse-chase  analysis  of  FDHFR-Ub-Cup9NSF  in 
the  presence  of  various  dipeptides  at  10  mM.  Dipeptides  bearing  either  basic 
(Arg-Ala)  or  bulky  hydrophobic  (Leu- Ala)  destabilizing  N-terminal  residues  strongly 
enhance  Cup9NSF  degradation,  but  only  in  strains  expressing  Ubrlp.  Dipeptides 
bearing  a  stabilizing  N-terminal  residue  (Ala-Arg  and  Ala-Leu)  do  not  alter  Cup9NSF 
degradation,  c.  The  effects  of  different  concentrations  of  Trp-Ala  on  the 
enhancement  of  Cup9NSF  degradation  in  wildtype  ( UBR1 )  cells.  Lanes  marked  by  a 
dash  indicate  pulse-chase  analysis  performed  in  the  absence  of  added  dipeptides. 
Enhancement  of  Cup9NSF  degradation  was  detectable  at  1  pM  Trp-Ala,  and  became 
substantially  greater  at  10  |xM. 

Figure  2.  Effects  of  dipeptides  on  expression  of  the  dipeptide  transporter  gene 
PTR2.  a,  Induction  of  PTR2  expression  by  dipeptides  bearing  destabilizing 
N-terminal  residues  (Trp-Ala  and  Arg-Ala)  required  both  UBR1  and  CUP9. 
Dipeptides  bearing  a  stabilizing  N-terminal  residue  (Ala-Trp  and  Ala-Arg)  had  no 
effect  on  PTR2  expression.  PTR2  mRNA  and  the  ACPI  mRNA  loading  control  are 
indicated,  b.  Effect  of  different  concentrations  of  Trp-Ala  on  the  levels  of  PTR2 
mRNA. 

Figure  3.  In  vitro  ubiquitylation  of  Cup9p  is  enhanced  by  dipeptides  bearing 
destabilizing  N-terminal  residues.  Reactions  consisting  of  purified  Ubalp  (El), 
Rad6p  (E2),  Ubrlp  (E3),  Ub,  ATP  and  radiolabeled  Cup9p,  were  supplemented  with 
the  indicated  dipeptides  or  amino  acids  (top  panel,  2  |o,M,  bottom  panel,  10  |iM),  or 
left  unsupplemented  (denoted  by  a  dash),  and  allowed  to  proceed  at  30°C  for  the 
designated  times.  Radiolabeled  input  Cup9p,  and  its  multi-ubiquitylated  derivatives 
are  indicated. 
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Figure  4.  Feedback  regulation  of  peptide  import  in  S.  cerevisiae.  a.  Genetic 
diagram  of  the  peptide  transport  circuit,  b,  Ubrlp  is  required  for  dipeptide  uptake. 

In  the  absence  of  Ubrlp  ( ubrlA ),  the  transcriptional  repressor  Cup9p  is  long-lived, 
accumulates  to  high  levels,  and  extinguishes  the  expression  of  the  PTR2  gene.  Thus, 
ubrlA  cells  cannot  import  dipeptides  (red  dots),  c.  In  a  wildtype  ( UBR1 )  cell  growing 
in  the  absence  of  extracellular  dipeptides,  Ubrlp  targets  Cup9p  for  degradation 

(ti/2  ~5  min),  resulting  in  a  moderate  concentration  of  Cup9p  and  weak  but 

significant  expression  of  the  Ptr2p  transporter  (blue  double  ovals),  d.  In  wildtype 
(UBR1)  cells  growing  in  the  presence  of  extracellular  dipeptides  some  of  which  bear 
destabilizing  N-terminal  residues,  the  imported  dipeptides  bind  to  either  the  basic 
(type  1)  or  the  hydrophobic  (type  2)  residue-binding  site  of  Ubrlp.  These  peptides 
are  denoted  as  a  red  block  and  a  green  wedge,  respectively.  Binding  of  either  type  of 
dipeptide  to  Ubrlp  allosterically  increases  the  rate  of  Ubrlp-mediated  degradation  of 
Cup9p.  Peptides  of  both  types  are  shown  as  bound  to  Ubrlp,  but  in  fact  the  binding 
of  either  peptide  accelerates  Cup9p  degradation.  The  resulting  decrease  of  the 
half-life  of  Cup9p  from  ~5  min  to  less  than  1  min  results  in  a  very  low  concentration 
of  Cup9p,  and  consequently  to  a  strong  induction  of  the  Ptr2p  transporter,  e.  Colony 
formation  assays.  A  S.  cerevisiae  strain  requiring  leucine  for  growth  was  incubated  on 
plates  supplemented  with  either  dipeptides  or  their  amino  acid  constituents  at  the 
following  concentrations:  Leu- Ala,  230  pM  (or  2  pM,  as  indicated);  Ala-Leu,  230  pM; 
Arg-Ala,  10  pM;  Ala-Arg,  10  pM;  Lys-Ala,  1  pM;  Ala-Lys,  1  pM;  Arg  and  Ala, 

10  pM  each,  Lys  and  Ala,  1  pM  each. 
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ABSTRACT 

The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its 
N-terminal  residue.  N-terminal  asparagine  and  glutamine  are  tertiary  destabilizing 
residues,  in  that  they  are  enzymatically  deamidated  to  yield  secondary  destabilizing 
residues  aspartate  and  glutamate,  which  are  conjugated  to  arginine,  a  primary 
destabilizing  residue.  N-terminal  arginine  of  a  substrate  protein  is  bound  by  the 
Ubrl -encoded  E3a,  the  E3  component  of  the  ubiquitin/proteasome-dependent  N-end 
rule  pathway.  We  describe  the  construction  and  analysis  of  mouse  strains  lacking  the 
asparagine-specific  N-terminal  amidase  (NtN-amidase),  encoded  by  the  Ntanl  gene. 

In  wildtype  embryos,  Ntanl  was  strongly  expressed  in  the  branchial  arches,  and  in  the 

tail  and  limb  buds.  The  Ntanl'/'  mouse  strains  lacked  the  NtN-amidase  activity  but 
retained  glutamine-specific  N tQ- amidase,  indicating  that  the  two  enzymes  are 
encoded  by  different  genes.  Among  the  normally  short-lived  N-end  rule  substrates, 
only  those  bearing  N-terminal  asparagine  became  long-lived  in  Ntanl'/-  fibroblasts. 

The  Ntanl'/'  mice  were  fertile  and  outwardly  normal,  but  differed  from  their 
congenic  wildtype  counterparts  in  spontaneous  activity,  spatial  memory,  and 
a  socially  conditioned  exploratory  phenotype  that  has  not  been  previously  described 
with  other  mouse  strains. 
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INTRODUCTION 

A  multitude  of  regulatory  circuits  involve  conditionally  or  constitutively 
short-lived  proteins  (26,  27 ,  44,  48,  49,  64).  Features  of  proteins  that  confer  metabolic 
instability  are  called  degradation  signals,  or  degrons  (37,  63).  The  essential  component 
of  one  degradation  signal,  termed  the  N-degron,  is  a  destabilizing  N-terminal  residue 
of  a  protein  (3).  A  set  of  N-degrons  containing  different  N-terminal  residues  which 
are  destabilizing  in  a  given  cell  yields  a  rule,  termed  the  N-end  rule,  which  relates  the 
in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal  residue.  An  N-end  rule 
pathway  is  present  in  all  organisms  examined,  from  mammals  and  plants  to  fungi 
and  prokaryotes  (63). 

In  eukaryotes,  an  N-degron  comprises  two  determinants:  a  destabilizing 
N-terminal  residue  and  an  internal  lysine  of  a  substrate  protein  (4,  32,  60).  The  Lys 
residue  is  the  site  of  formation  of  a  substrate-linked  multiubiquitin  chain  (15,  49). 

The  N-end  rule  pathway  is  thus  one  pathway  of  the  ubiquitin  (Ub)  system  (25-27). 

Ub  is  a  76-residue  eukaryotic  protein  that  exists  in  cells  either  free  or  covalently 
conjugated  to  many  other  proteins.  The  Ub  system  plays  a  role  in  a  vast  range  of 
processes,  including  cell  growth,  division,  differentiation,  and  responses  to  stress. 

In  most  of  these  processes,  Ub  acts  through  routes  that  involve  the  degradation  of 
Ub-protein  conjugates  by  the  26S  proteasome,  an  ATP-dependent  multisubunit 
protease  (10, 17, 20, 51). 

The  N-end  rule  has  a  hierarchic  structure.  In  the  yeast  S.  cerevisiae,  Asn  and 
Gin  are  tertiary  destabilizing  N-terminal  residues  in  that  they  function  through  their 

conversion,  by  the  NTAl-e ncoded^  N-terminal  amidohydrolase  (Nt-amidase),  into 
the  secondary  destabilizing  N-terminal  residues  Asp  and  Glu  (6).  Destabilizing 
activity  of  N-terminal  Asp  and  Glu  requires  their  conjugation,  by  the  S.  cerevisiae 
ATEl-e ncoded  Arg-tRNA-protein  transferase  (R-transferase)  (8,  41),  to  Arg,  one  of  the 
primary  destabilizing  residues  (Fig.  1A).  In  mammals,  the  deamidation  step  is 
mediated  by  two  Nt-amidases,  NtN-amidase  and  NtQ-amidase,  which  are  specific, 
respectively,  for  N-terminal  Asn  and  Gin  (Fig.  1A)  (24,  59).  The  mammalian 
counterpart  of  the  yeast  R-transferase  Atelp  exists  as  two  distinct  species,  ATEl-lp  and 
ATElp-2,  which  are  produced  through  alternative  splicing  of  Atel  pre-mRNA  (34). 
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In  vertebrates,  the  set  of  secondary  destabilizing  residues  contains  not  only  Asp  and 
Glu  but  also  Cys,  which  is  a  stabilizing  residue  in  yeast  (Fig.  1A)  (18,  23).  The  primary 
destabilizing  N-terminal  residues  are  bound  directly  by  the  UBRl-encoded 
N-recognin,  the  targeting  (E3)  component  of  the  N-end  rule  pathway.  In  S.  cerevisiae, 
Ubrlp  is  a  225  kDa  protein  which  recognizes  potential  N-end  rule  substrates  through 
its  type  1  and  type  2  substrate-binding  sites.  The  type  1  site  binds  the  basic  N-terminal 
residues  Arg,  Lys,  and  His.  The  type  2  site  binds  the  bulky  hydrophobic  N-terminal 
terminal  residues  Phe,  Leu,  Trp,  Tyr,  and  lie  (35,  63).  Ubrlp  contains  yet  another 
substrate-binding  site  that  targets  proteins  such  as  Cup9p  and  Gpalp,  which  bear 
internal  (non-N- terminal)  degrons  (12,  54).  The  Ubrl  genes  encoding  mouse  and 
human  N-recognins,  also  called  E3a,  have  been  cloned  (36),  and  mouse  strains  lacking 
Ubrl  have  recently  been  constructed  (Y.  T.  Kwon  and  A.  Varshavsky,  unpublished 
data). 

The  known  functions  of  the  N-end  rule  pathway  include  the  control  of  peptide 
import  in  S.  cerevisiae,  through  the  degradation  of  Cup9p,  a  transcriptional  repressor 
of  PTR2  which  encodes  the  peptide  transporter  (1,  12);  a  mechanistically  undefined 
role  in  regulating  the  Slnlp-dependent  phosphorylation  cascade  that  mediates 
osmoregulation  in  S.  cerevisiae  (47);  the  degradation  of  alphaviral  RNA  polymerases 
and  other  viral  proteins  in  infected  metazoan  cells  (19,  38);  and  the  degradation  of 
Gpalp,  a  Ga  protein  of  S.  cerevisiae  (43,  54).  Physiological  N-end  rule  substrates  were 
also  identified  among  the  proteins  secreted  into  the  mammalian  cell's  cytosol  by 
intracellular  parasites  such  as  the  bacterium  Listeria  monocytogenes.  Short  half-lives 
of  these  bacterial  proteins  are  required  for  the  efficient  presentation  of  their  peptides 
to  the  immune  system  (56).  Inhibition  of  the  N-end  rule  pathway  was  reported  to 
interfere  with  mammalian  cell  differentiation  (28),  and  to  delay  limb  regeneration  in 
amphibians  (61).  Studies  of  the  Ub-dependent  proteolysis  of  endogenous  proteins  in 
muscle  extracts  suggested  that  the  N-end  rule  pathway  plays  a  role  in  catabolic  states 
that  result  in  muscle  atrophy  (39,  57).  A  crush  injury  to  the  rat  sciatic  nerve  was 
reported  to  result  in  a  ~  10-fold  increase  in  the  rate  of  arginine  conjugation  to  the 
N-termini  of  proteins  in  the  nerve's  region  upstream  of  the  crush  site,  suggesting  an 
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injury-induced  increase  in  the  concentration  of  R-transferase  substrates  and/or  an 
enhanced  activity  of  the  N-end  rule  pathway  (65). 

Physiological  substrates  of  either  yeast  or  metazoan  Nt-amidases  and 
R-transferases  are  unknown.  Engineered  N-end  rule  substrates,  including  substrates 
of  Nt-amidases  and  R-transferases,  can  be  produced  in  vivo  through  the  Ub  fusion 
technique,  in  which  a  Ub-X-reporter  fusion  is  cleaved  by  deubiquitylating  enzymes 
(DUBs)  (66)  after  the  last  residue  of  Ub,  yielding  a  reporter  bearing  the  desired 
N-terminal  residue  X  (3,  63). 

The  mouse  Asn-specific  NtN-amidase  is  encoded  by  the  17  kb  Ntanl  gene. 

The  1.6  kb  Ntanl  mRNA  specifies  the  310-residue  NtN-amidase  (24).  In  the  present 
work,  we  characterized  the  expression  and  intracellular  localization  of  NtN-amidase. 
We  also  constructed  mouse  strains  bearing  a  homozygous  deletion/disruption  of 
Ntanl,  and  showed  that  these  mice  lacked  both  NtN-amidase  and  the  Asn-specific 

branch  of  the  N-end  rule  pathway.  The  Ntanl'/'  mice  were  fertile,  outwardly 
normal,  but  were  found  to  differ  from  their  congenic  wildtype  counterparts  in  their 
spontaneous  activity  and  spatial  memory.  Among  these  differences  was  a  socially 

conditioned  exploratory  phenotype  of  Ntanl'/- m ice  that  has  not  been  previously 
described,  to  our  knowledge,  with  other  mouse  strains. 

MATERIALS  AND  METHODS 

Construction  of  mouse  strains  lacking  NtN-amidase.  Genomic  Ntanl  DNA 
fragments  were  subcloned  from  the  PI  phage  DNA  containing  the  strain  C129-derived 
mouse  Ntanl  gene  (24).  The  1,462  bp  HmdIII-Psfl  fragment  containing  exon  6  and  its 
flanking  introns  was  used  as  the  vector's  short  homology  arm.  The  7.2  kb  Xhol-Notl 
fragment  containing  exon  1  and  its  flanking  intron  was  used  as  the  long  homology 
arm.  To  construct  the  targeting  vector,  a  1,462  bp  HmdIII-Psfl  fragment  containing 
exon  6  and  its  flanking  introns  (the  vector's  short  arm)  and  the  PGK/TK  cassette  of 
PPNT  (a  gift  from  R.  C.  Mulligan,  Harvard  Medical  School,  Boston,  MA)  were 
inserted  into  pPGKRN  containing  the  wildtype  PGK/neo  cassette  (a  gift  from 
R.  Jaenisch,  Whitehead  Institute,  Cambridge,  MA),  yielding  pPGK-SA.  The  vector's 
7.2  kb  long  arm  was  produced  by  joining,  within  pPGK-SA,  a  4.8  kb  Xhol-EcoRl 
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fragment  to  its  flanking  2.6  kb  EcoRI-EcoRI  fragment  containing  exon  1  and  flanking 
intron  pPGK-SA,  yielding  the  Ntanl  targeting  vector  pNTANl-KO  (Fig.  IB).  The 
X/zoI-linearized  targeting  vector  (Fig.  IB)  was  electroporated  into  CJ7  ES  cells. 

Selection  with  G418  (at  0.4  mg/ml)  and  l-(2,-deoxy/  2'  fhioro-(3-d-arabinofuranosyl)- 
5-iodouracil  (FIAU)  (at  0.4  gM)  was  started  24  h  after  electroporation.  Correctly 
targeted  ES  cells  were  identified  by  PCR  and  Southern  hybridization  with  the  5'  and  3' 
probes  (Fig.  1C,  D).  Cells  of  the  12  independent  ES  cell  clones  were  injected  into  3.5 
dpc  (days  postcoitum)  C57BL/6J  blastocysts.  The  resulting  male  chimeras  were  bred 
with  C57BL/6J  females  to  test  for  germline  transmission  of  the  mutated  Ntanl  gene. 

The  Ntanl* mice  resulting  from  this  cross  (6  out  of  12  independent  ES  clones  were 

found  to  populate  germline  in  these  tests)  were  intercrossed  to  produce  Ntanl~/~ 
mice.  Alternatively,  the  initial  male  chimeras  were  mated  with  129/SvEv  females, 

yielding,  through  the  analogous  series  of  steps,  Ntanl~/~  mice  in  the  strain-129 

background.  All  of  the  behavioral  tests  were  carried  out  with  Ntanl~/~  mice  in  the 
strain-129  background.  For  genotyping,  the  tail-derived  DNA  was  analyzed  by  PCR,  or 
digested  with  either  BamHI  or  HindUL/Xhol,  and  analyzed  by  Southern  hybridization. 
A  0.75  kb  Psfl  fragment  and  a  0.9  kb  Pstl  fragment  (indicated  in  Fig.  IB)  were  used  as 
the  5'  and  3'  hybridization  probes,  respectively. 

Northern  and  RT-PCR  analyses  of  RNA.  Total  RNA  was  isolated  from  brain, 

testis  and  EF  cells  of  wildtype  and  Ntanl~/~  mice  as  described  (34).  RNA  was 
fractionated  by  electrophoresis  in  1%  formaldehyde-agarose  gels,  blotted  onto  Hybond 
N+  (Amersham),  and  hybridized  with  32P-labeled  probes  specific  for  different  regions 
of  the  Ntanl  cDNA  (GenBank  #U57692).  Probe  a,  nt  34-900;  probe  b,  nt  118-450;  probe 
c,  nt  118-900;  probe  d,  nt  34-450;  probe  e,  nt  470-670;  probe  f,  nt  680-900.  Alternatively, 
RT-PCR  was  carried  out  (2),  with  first-strand  cDNA  synthesized  using  Superscript  II 
polymerase  (GIBCO,  Frederick,  MD).  PCR  was  done  using  primers  specific  for 
different  regions  of  the  Ntanl  cDNA.  Primer  PI,  S'-ATGCCACTGCTGGTGGATGG- 
GCAG  (forward);  P2, 5'-GAGCCAGACTTCTCAGAGGTCAG  (forward);  P3, 5'-GACA- 
TTCACTTAGTGACATTATG  (forward);  P4,  5'-TTCTGTGACAGCTGCCTGTCATC 
(reverse);  P5, 5'-CATCAAGGTAGATCTAATATGTTC  (reverse).  The  ratio  of  mouse 
Atel-1  and  Atel-2  mRNAs  was  determined  as  described  (34). 
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Whole-mount  in  situ  hybridization.  Wildtype  and  Ntanl '/"  embryos  were 
staged,  fixed,  and  processed  for  in  situ  hybridization  as  described  (16).  A  0.3  kb 
fragment  of  the  Ntanl  cDNA  (nt  108-448)  that  was  encompassed  by  the  deleted  region 

in  the  Ntanl'/-  allele  was  subcloned  into  Xbal-Xhol  sites  of  Bluescript  II  SK+ 
(Stratagene),  and  the  resulting  plasmid,  pMR27,  was  used  as  a  template  for 
synthesizing  antisense  RNA  probe  labeled  with  digoxigenin  (Roche  Molecular 
Biochemicals,  Indianapolis,  IN). 

Localization  of  NTANlp-GFP.  The  mouse  Ntanl  ORF  was  subcloned  into  the 
Xho I  and  ASel  sites  of  pEGFP-Nl  (Clontech),  yielding  pNTANl-GFP,  which  expressed 
the  NTANlp-GFP  fusion  protein  from  the  PCMV  promoter.  NIH  3T3  cells  (ATCC 
1658-CRL)  were  grown  as  monolayers  in  Dulbecco's  Modified  Eagle  Medium 
(Gibco-BRL)  supplemented  with  10%  fetal  bovine  serum.  Cells  were  grown  to  ~15% 
confluence  on  glass  coverslips  for  24  hr  prior  to  transfection  with  either  the 
GFP-expressing  control  vector  pEGFP-Nl  or  pNTANl-GFP,  using  Lipofectamine 
(GIBCO)  and  the  manufacturer-supplied  protocol.  Cells  were  incubated  for  5  hr  at 
37°C  in  the  serum-free  media  containing  DNA  and  Lipofectamine.  An  equal  volume 
of  media  containing  20%  serum  was  then  added,  and  the  cells  were  grown  for  another 
12  to  20  hr  at  37°C.  Cells  were  fixed  with  2%  formaldehyde  in  phosphate-buffered 
saline,  and  GFP  fluorescence  was  examined  using  Zeiss  Axiophot  microscope. 

Immortalization  of  embryonic  fibroblasts,  transfection,  and  pulse-chase 
analysis.  Embryonic  fibroblast  (EF)  cells  were  isolated  from  either  Ntanl'/'  or  +/+ 

13.5  days  old  (el3.5)  embryos  as  described  (52).  EFs  were  grown  in  DMEM/F12 
medium  (GIBCO)  supplemented  with  15%  fetal  bovine  serum,  antibiotics,  and  2  mM 
L-glutamine.  They  were  immortalized  using  the  SV40  virus  (14).  The 
SV40-transformed  cell  lines  were  transiently  transfected,  using  Lipofectamine  PLUS 
(GIBCO),  with  the  plasmids  pRC/ dhaUbXnsP4Pgal,  which  expressed 
DHFR-ha-Ub R48-X-nsP4 Pgal  test  proteins  (X=Met,  Asn,  Gin,  or  Arg)  (40)  (see  the  main 
text).  Cells  were  labeled  with  35S-Translabel  (New  England  Nuclear,  Boston,  MA), 
followed  by  a  chase  for  0,  1,  and  2  hr  in  the  presence  of  cycloheximide,  preparation  of 
extracts,  immunoprecipitation,  SDS-PAGE,  autoradiography,  and  quantitation  with 
Phosphorlmager,  essentially  as  described  (40). 
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X-DHFR  test  proteins.  35S-labeled  X-DHFR  proteins  (X  =  Asn,  Gin,  or  Asp)  were 
prepared  as  described  (24).  Briefly,  the  plasmids  pSG4,  pSG41,  and  pSG44  expressing, 
respectively,  Ub-Asn-DHFR,  Ub-Asp-DHFR  and  Ub-Gln-DHFR  from  the  Ptrc 
promoter  (24)  were  transformed  into  E.  coli  JM101  carrying  pJT184,  which  expressed 
Ubplp,  a  deubiquitylating  enzyme  (DUB)  of  S.  cerevisiae  (62).  Cells  of  a  50-ml  culture 
grown  at  37°C  to  an  Agoo  of  ~0.9  in  LB  plus  ampicilin  (40  gg/ml)  and  chloramphenicol 
(20  gg/ml),  were  pelleted  and  resuspended  in  50  ml  of  M9  supplemented  with  glucose 
(0.2%),  thiamine  (2  gg/ml),  ampicilin  (40  gg/ml),  1  mM  isopropylthio-p-D-galactoside 
(IPTG),  and  Methionine  Assay  Medium  (Difco).  The  suspension  was  shaken  for  1  hr 
at  37°C,  followed  by  the  addition  of  1  mCi  of  35S-Translabel  (ICN)  and  further 
incubation  for  1  hr  at  37°C.’  Unlabeled  L-methionine  was  then  added  to  1  mM,  and 
the  shaking  was  continued  for  another  10  min.  The  cells  were  harvested  and  the 
lysate  were  prepared  by  the  addition  of  lysozyme  and  Triton  X-100,  followed  by 
centrifugation  (24).  The  supernatant,  containing  [35S]X-DHFR  (X=Asn,  Asp,  or  Glu) 
was  purified  by  affinity  chromatography  on  a  methotrexate  column  (Pierce)  (0.5  ml 
bed  volume).  [35S]X-DHFRs  were  examined  by  SDS-PAGE;  they  were  greater  than 
95%  pure. 

Isoelectrofocusing  assay  for  amidase  activity.  Extracts  from  wildtype  or 

Ntanl-/-  EF  cells  were  prepared  by  homogenization  in  0.01%  Triton  X-100, 10  % 
glycerol,  125  mM  KC1,  7.5  mM  MgCl2, 1  mM  DTT,  0.25  mM  EDTA,  0.2  mM 
pheny lmethy lsulf onyl  fluoride  (PMSF),  130  mM  Tris-HCl  (pH  7.5),  and  the  protease 
inhibitors  antipain,  chymostatin,  leupeptin,  pepstatin  and  aprotinin  (Sigma),  each  at 
25  gg/ ml.  For  the  deamidation  assay,  5  fxl  of  35S-labeled  X-DHFR  (0.5  mg/ml  in  storage 
buffer;  X=Asn,  Asp,  or  Glu)  was  mixed  with  20  jul  of  the  extract,  incubated  for  2  hr  at 
37°C,  and  placed  on  ice.  Samples  (5  gl)  were  applied  onto  isoelectrofocusing 
polyacrylamide  (IEF-PAGE)  plates  (pH  3 .5-9.5)  (Pharmacia,  Piscataway,  NJ,  USA) 
precooled  to  10°C.  IEF  was  carried  out  for  80  min  at  30  W  in  a  cooled  IEF  apparatus 
(Hoefer,  San  Francisco,  CA).  The  plates  were  soaked  in  100  ml  of  10%  CCl3COOH,  5% 
5-sulfosalicylic  acid,  stained  with  Coomassie  Blue  to  detect  IEF  markers  (Pharmacia), 
and  autoradiographed.  35S  in  the  bands  of  more  acidic  (deamidated)  and  more  basic 
(initial)  X-DHFR  species  was  quantitated  using  Phosphorlmager  (Molecular 
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Dynamics,  Sunnyvale,  CA). 

Behavioral  tests.  Mice  were  kept  on  a  0600-  to  1800-hr  light  cycle  and  tested 
during  the  light  period.  For  the  rotarod,  weight  retention,  coat-hanger  and 

platform-leaving  tests,  the  strain-129  Ntanl~/~  mice  and  their  congenic  +/+ 

littermates  (produced  through  matings  of  Ntanl~/+  mice)  were  used.  Non-littermates 
were  also  used  for  the  platform-leaving  test  (see  below).  For  the  Shuttlebox  and 
passive  avoidance  tests,  the  elevated  plus-maze  test,  the  open  field  test,  the  Morris 
maze,  the  radial-arm  maze,  and  the  Lashley  maze,  the  strain-129  +/+  and  congenic 

Ntanl~/~  mice  (produced,  respectively,  through  +/+  X  +/+  and  Ntanl~/~  XNtanl~/~ 
matings)  were  used.  The  same  series  of  tests  were  repeated  six  weeks  or  more  after 
the  initial  experiment,  using  both  male  and  female  mice.  To  determine  whether  the 
Ntanl  genotype  affected  long-term  memory,  mice  were  retested  in  the  Morris  maze, 
Lashley  maze  and  the  Shuttlebox  avoidance  test  at  least  7  weeks  after  their  original 
learning.  The  data  were  evaluated  using  the  analysis  of  variance  (ANOVA). 
Significance  was  set  at  the  .05  level;  all  behavioral  differences  described  in  Results 
were  at  this  or  higher  levels  of  statistical  significance. 

Rotarod  test.  The  apparatus  (UGO,  Basile,  Italy)  consisted  of  a  motor-driven 
rod  3  cm  in  diameter  that  carried  5  compartments,  separated  by  walls  25  cm  in  height. 
One  pair  of  littermates  was  placed  on  the  rotating  rod  in  each  compartment.  Mice 
were  placed  on  the  rod  at  either  10  or  20  rpm;  the  cutoff  time  was  2  min.  Each  mouse 
was  tested  on  the  rod  6  times,  at  1  hr  intervals. 

Weight  retention.  Mice  were  suspended  by  the  tails  and  made  to  grab  a  wire 
loop  (wire  diameter  2  mm;  weight  40  g)  70  cm  above  the  floor.  Time  elapsed  before 
the  animal  released  the  loop  was  measured.  Each  mouse  was  tested  5  times  a  day,  at 
1  hr  intervals,  for  the  total  of  3  days. 

Coat-hanger  test.  A  wire  coat-hanger  (wire  diameter,  2  mm;  length  of  side  bars, 
29  cm;  length  of  the  horizontal  bar,  45  cm)  suspended  75  cm  above  the  table  was  used 
to  examine  motor  coordination  (11).  A  trial  began  by  allowing  the  mouse  to  grab  the 
middle  of  the  horizontal  bar  with  two  front  paws.  The  time  elapsed  before  the  animal 
grabbed  the  horizontal  bar  with  all  four  paws,  and  the  time  before  the  two  front  paws 
reached  one  of  the  side  bars  were  determined.  Each  mouse  in  the  trial  was  tested 
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twice  a  day,  at  6  hr  intervals,  for  9  days. 

Shuttlebox  avoidance,  passive  avoidance,  and  elevated  plus-maze  tests.  For  the 
shuttlebox  and  passive  avoidance  tests,  the  Gemini  Avoidance  system  (San  Diego 
Instruments,  San  Diego,  CA)  was  used.  The  procedures  are  described  in  Results. 

Open  field  test.  For  the  open-field  activity  measurements,  the  Omnitech 
Digiscan  Animal  Activity  Monitor  was  used.  Each  mouse  was  placed  into  a  square 
chamber  (20  x  20  cm)  for  one  9-min  session,  and  the  movements  were  tabulated  along 
the  X  and  Y  axes,  using  infrared  sensors. 

Morris  maze.  A  mouse  was  placed  into  a  circular  black  water  tub  (d=123  cm; 
temperature  19-21°C)  from  one  of  the  four  locations  (N,  S,  E,  W),  and  had  to  find  a 
platform  (d=23  cm)  submerged  1  cm  below  the  surface  of  water  and  set  23  cm  from  the 
wall.  There  were  no  cues  within  the  maze,  so  the  animal  had  to  use  extra-maze 
spatial  cues  in  guiding  itself  to  the  platform  in  repeated  trials  (46).  An  animal  was 
given  4  trials  per  day  (maximum  of  45  sec  per  trial),  one  from  each  of  the  four 
locations  ,  their  order  determined  quasi-randomly.  Each  mouse  was  tested  for  5  days, 
with  the  platform  in  a  fixed  position.  On  day  6,  the  platform  was  moved  to  the 
quadrant  diagonally  opposite  its  previous  location,  and  each  mouse  was  given  4  trials, 
as  before.  This  regimen  is  referred  to  as  "reversal  learning".  The  mice  were  re-tested 
(with  the  platform  in  the  original  position)  7  or  8  weeks  later.  The  animal's  path  was 
traced  on  an  electronic  digitizing  tablet  containing  a  template  of  the  maze  (22).  The 
following  parameters  were  recorded:  total  time;  distance  traveled;  average  speed; 
and  percent  of  time  spent  in  each  quadrant  of  the  maze. 

Spatial  radial-arm  maze.  The  apparatus  and  testing  procedure  were  described 
previously  (30).  Hidden  escape  platforms  were  placed  at  the  ends  of  4  of  the  8 
identical  arms  which  radiated  from  the  central  area.  During  training  the  animal  was 
released  from  a  start  arm  and  had  to  find  one  of  the  escape  platforms.  Once  a 
platform  was  found,  the  arm  containing  that  platform  was  blocked.  The  training 
session  consisted  of  4  trials  in  which  a  mouse  had  to  locate  all  4  platforms.  This  was 
followed  by  11  testing  sessions  which  differed  from  the  training  session  only  by 
removing  the  platform  while  the  mouse  was  in  its  home  cage  between  trials.  Thus, 
on  successive  trials,  the  animal  had  to  remember  which  alleys  it  had  previously 
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entered  and  not  enter  those  alleys  again.  The  score  was  total  errors  obtained  by 
summing  together  Working  Memory  Correct  errors,  Working  Memory  Incorrect 
errors,  and  Reference  Memory  errors  (30). 

Non-spatial  radial-arm  maze  learning.  The  non-spatial  swimming  radial-arm 
maze  apparatus  and  testing  procedure  were  similar  to  the  above  spatial  version, 
except  for  the  following.  Each  of  the  arms  was  painted  in  a  different  black/white 
pattern:  solid  black,  solid  white,  vertical  stripes,  horizontal  stripes,  black  with  white 
polka  dots,  white  with  black  polka  dots,  zigzags,  and  the  galvanized  steel  gray  color 
(the  start  arm).  The  patterned  arms  which  contained  platforms  were  different  for 
each  mouse,  but  remained  constant  throughout  the  testing  of  a  given  mouse.  For  the 
training  session  (session  1),  the  entrance  to  the  arm  with  the  just  chosen  platform  was 
blocked,  and  in  addition,  the  maze  was  quasi-randomly  rotated,  so  that  each  patterned 
arm  now  pointed  toward  a  different  place  in  space.  As  in  the  spatial  version,  the  test 
sessions  (sessions  2-12)  were  the  same  as  the  training  session,  except  that  while  the 
mouse  was  in  its  home  cage,  after  finding  a  platform,  this  platform  was  removed 
from  the  maze,  and  the  arm's  entrance  remained  open  for  the  remainder  of  the 
session.  Over  8  trials,  or  2  sessions,  each  patterned  arm  pointed  toward  each  of  the 
8  spatial  locations  in  the  testing  room.  The  errors  were  scored  as  above. 

Lashley  maze  learning.  This  maze  contained  cul-de-sacs  that  an  animal  had  to 
learn  to  avoid,  and  T-choices  that  required  the  animal  to  learn  making  correct  left  or 
right  turns.  A  water  version  of  the  maze  was  used,  with  the  temperature  at  19-20°C 
(21).  All  mice  were  given  one  trial  per  day  for  5  days.  An  animal  can  learn  this  maze 
using  two  mutually  non-exclusive  strategies:  by  memorizing  extra-maze  spatial  cues, 
or  by  memorizing  the  sequence  of  correct  left  and  right  turns.  The  measures  of 
learning  included  the  learning  index,  defined  as  the  number  of  correct  entries  divided 
by  total  number  of  entries,  the  number  of  cul  entries  (entries  into  dead  ends)  when 
swimming  forward  (i.  e.,  toward  the  goal),  number  of  forward  T-choice  errors,  and  the 
total  number  of  backward  errors.  Retention  testing  was  done  7  weeks  later  for 
Experiment  1,  and  8  weeks  later  for  Experiment  2. 

Platform-leaving  test.  To  compare  the  exploratory  activities  of  two  previously 
untested  mice  on  the  same  platform,  we  devised  the  platform-leaving  test.  One 
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Ntanl  /  mouse  of  the  strain  129  and  one  congenic  +/+  mouse  of  the  same  gender 
(either  a  littermate  or  a  non-littermate)  were  placed,  at  the  same  time,  on  the  platform 
(16  cm  x  22  cm)  2.7  cm  above  the  surface  of  a  laboratory  bench.  For  each  test,  the 
platform  was  covered  with  fresh  white  paper.  Mice  were  allowed  to  either  explore  or 
step  down  from  the  platform  up  until  the  cutoff  time  of  3  min.  (Step-down  was 
recorded  when  all  four  paws  of  a  mouse  were  no  longer  in  contact  with  platform.) 

The  number  and  genotypes  of  mice  leaving  the  platform  first,  and  the  number  of 
mice  not  leaving  by  3  min  were  recorded.  A  total  of  3  trials,  at  1-hr  intervals,  were 
carried  out  for  every  pair  of  mice  tested  on  a  given  day.  Each  set  of  tests  employed 
~10  pairs  of  mice  2  to  4  months  old.  Four  independent  sets  of  tests,  with  previously 
untested  mice,  were  carried  out  over  the  period  of  6  months. 

RESULTS 

Construction  of  Ntanl~^~  mice.  A  deletion/disruption  allele  of  the  ~17  kb 
mouse  Ntanl  gene  was  constructed  by  replacing  the  Ntanl  exons  2-5  with  a 
neomycin-resistance  cassette  (Fig.  IB).  Exons  2-5  encoded  a  118-residue  region  of  the 
310-residue  NTANlp  (NtN-amidase)  that  was  significantly  conserved  among  the 
NTANlp  proteins  of  different  species,  including  the  plant  Arabidopsis  thaliana,  the 
fly  Drosophila  melanogaster,  and  mammals  (data  not  shown).  Of  the  -1,000 
embryonic  stem  (ES)  cell  clones  (strain  129/CJ7)  resistant  to  both  G418  and  FIAU, 

33  clones  contained  the  expected  mutation,  as  verified  by  PCR  and  Southern  analyses 
(data  not  shown).  Twelve  of  these  correctly  targeted  ES  cell  clones  were  used  to 

generate  male  chimeras,  and  in  six  of  them  the  Ntanl~  allele  was  transmitted  through 
germ  line.  Male  chimeras  were  mated  with  either  129/SvEv  or  C57BL/6  females, 

yielding  Ntanl+/~  heterozygotes.  Intercrosses  of  heterozygous  mice  produced 
Ntanl-/-  progeny  at  the  expected  frequency  of  approximately  1  in  4  (Figure  1C,  D), 
indicating  that  the  absence  of  NTANlp  did  not  increase  embryonic  lethality.  The 

behavioral  tests  described  below  utilized  exclusively  Ntanl V-  mice  produced  through 
matings  in  the  strain-129  background. 

Expression  of  Ntanl  and  intracellular  localization  of  NTANlp.  Several  regions 
of  Ntanl  transcripts  were  analyzed  using  RT-PCR  with  RNA  from  wildtype  and 
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Ntanl-f-  EF  cells.  The  level  of  Man  1 -derived  RNA  in  the  Ntanl~/~  EF  cells  was 
below  the  RT-PCR  detection  threshold  (Fig.  IE  and  data  not  shown).  Northern 
hybridization  analyses,  using  probes  encompassing  either  exons  2-5  (the  deleted 
region)  or  exons  1-5  detected  the  1.4  and  1.6  kb  Ntanl  mRNAs,  respectively,  in  the 
brains  and  testes  of  +/+  mice;  the  +/+  testes  also  contained  the  Ntanl-derived  RNAs 
of  1.1  and  4  kb  (Fig.  2A).  No  Ntanl  transcripts  were  detected  in  the  brains  and  testes  of 
Ntanl~ mice  with  these  probes  (Fig.  2Ab,  d). 

Similar  analyses  but  using  probes  encompassing  exons  1-10  or  2-10  (except  for 
the  IL2-similarity  region  of  exon  10  (24))  detected  at  most  trace  amounts  of 

Nfanl -derived  transcripts  in  the  brains  of  Ntanl -/-  mice  (Fig.  2Aa,  c).  However,  in 

the  testes  of  Ntanl  /  mice,  the  same  probes  detected  1.1  and  4  kb  Ntan  1  -derived, 
testis-specific  transcripts,  but  not  the  1.6  kb  transcript  (Fig.  2Aa,  c).  Further  mapping, 
using  the  probes  shown  in  Figure  2C,  indicated  that  the  1.1  and  4  kb  transcripts 
hybridized  to  Ntanl  exons  6-10  and  9-10,  respectively  (Fig.  2Ae,  f).  These  transcripts 
were  detected  with  antisense  but  not  with  sense  probes  (data  not  shown),  indicating 
the  direction  of  transcription  identical  to  that  for  the  full-length  Ntanl  mRNA.  Thus, 

the  testes  but  not  the  brains  of  Ntanl  -/-  mice  contained  a  set  of  Ntan  1 -derived 
transcripts  that  lacked  the  5 7 -ha If  of  the  Ntanl  ORF  (including  the  region  deleted  in 

Ntanl  /  mice)  and  encompassed  its  3'  half.  Apparently  the  same  transcripts  were 
present  in  the  +/+  testes  (Fig.  2Ae,  f).  Whether  these  testis-specific  transcripts  are 
physiologically  relevant  remains  to  be  determined. 

We  also  examined,  using  Northern  analysis,  whether  the  expression  of  other 

components  of  the  N-end  rule  pathway  was  altered  in  the  Ntanl~/~  mice.  No  changes 
in  the  expression  of  either  Ubr  1  (encoding  the  E3  of  the  N-end  rule  pathway)  or  Atel 
(encoding  R-transferase  (34);  see  Introduction)  were  detected  in  the  brains  and  testes 
of  Ntanl-/-  mice  (Fig.  2B).  The  expression  of  Ubr2  and  Ubr3  (homologs  of  Ubrl; 

Y.  T.  Kwon  and  A.  Varshavsky,  unpublished  data)  and  of  mHR6B,  encoding  the  E214K 

Ub-conjugating  (E2)  enzyme,  was  also  unchanged  in  the  Ntanl~/~  mice  (data  not 
shown).  In  addition,  no  alterations  in  the  ratio  of  Atel-1  mRNA  to  Atel-2  mRNA, 
which  encode  two  splicing-derived  forms  of  the  mouse  R-transferase  (34),  was 
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detected  in  the  brains,  testes,  and  EF  cells  of  Ntanl~/~  mice  (data  not  shown). 

Ntanl  was  expressed  in  most  if  not  all  tissues  of  adult  mice  (24).  Whole-mount 
in  situ  hybridization  was  used  to  examine  the  expression  of  Ntanl  in  +/+  mouse 
embryos.  In  e9.75  (9.75  days  old)  embryos,  Ntanl  was  most  strongly  expressed  in  the 
branchial  arches,  the  tail  bud  and  the  forelimb  buds;  the  same  probe  detected  no 

signal  in  the  Ntanl  embryos  (Fig.  3A).  In  el0.5  and  ell. 5  embryos,  the  expression 
of  Ntanl  became  high  in  the  hindlimb  buds  as  well;  this  expression  pattern  was 
indistinguishable  from  that  of  Ubrl  (Fig.  3 A),  consistent  with  the  assignment  of 
NTANlp  and  UBRlp  to  the  same  pathway. 

To  determine  the  intracellular  location  of  NTANlp,  it  was  expressed  in  mouse 
3T3  cells  as  a  fusion  to  the  N-terminus  of  Green  Fluorescent  Protein  (GFP).  Whereas 
the  free  26  kDa  GFP  was  distributed  uniformly  throughout  the  cell,  the  65  kDa 
NTANlp-GFP  was  enriched  in  the  nucleus,  was  also  present  in  the  nucleus-proximal 
cytoplasm,  but  was  apparently  much  less  abundant  at  the  cells'  periphery  (Fig.  3B  and 
data  not  shown). 

Ntanl  t  mice  lack  Nt^-amidase  and  the  asparagine-specific  branch  of  the 

N-end  rule  pathway.  To  determine  whether  Ntanl~ /-  EF  cells  lacked  the  NtN-amidase 
activity,  we  employed  a  previously  described  assay  (24).  The  35S-labeled  X-DHFR  test 
proteins  (X=Asn,  Gin,  or  Asp)  were  incubated  with  a  whole-cell  extract  and  thereafter 
fractionated  by  isoelectrofocusing  (IEF),  which  separated  Asn-DHFR  and  Gln-DHFR 
from  Asp-DHFR  and  Glu-DHFR,  respectively.  Incubation  of  Asn-DHFR  with  the 
extract  from  +/+  EF  cells  shifted  the  pi  of  this  protein  to  that  of  Asp-DHFR.  In 
contrast,  the  pi  of  Asn-DHFR  remained  unchanged  after  an  otherwise  identical 
incubation  with  the  same  amount  of  extract  from  Ntanl~/~  EF  cells  (Fig.  4A).  The  pi 
of  Gln-DHFR  shifted  to  that  of  Glu-DHFR  after  incubations  with  either  +/+  or 

Ntanl  /  extracts  (Fig.  4A).  Thus,  Ntanl~/~  EF  cells  lacked  the  NtN-amidase  activity, 
but  contained  wildtype  amounts  of  the  Gin-specific  NtQ-amidase.  The  latter  result 
indicated  that  the  putative  mammalian  NtQ-amidase  (which  remains  to  be  isolated 
and  cloned)  is  encoded  by  a  gene  distinct  from  Ntanl. 

To  examine  the  in  vivo  degradation  of  N-end  rule  substrates  in  the  absence  of 
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NtN-amidase,  the  immortalized  +/+  and  Ntanl~/~  EF  cell  lines  were  transiently- 
transfected  with  plasmids  that  expressed  X-nsP4pgal  test  proteins  (X=Met,  Asn,  Gin,  or 
Arg).  An  X-nsP4pgal  was  expressed  as  part  of  a  fusion  containing  the  ha-tagged 
reference  protein  dihydrofolate  reductase  (DHFR).  In  this  previously  developed  UPR 
(Ub /protein/reference)  technique  (40,  60),  a  DHFR-ha-UbR48-X-nsP4Pgal  fusion  is 
cotranslationally  cleaved  by  DUBs  at  the  Ub-X-nsP4Pgal  junction,  yielding  the 
long-lived  DHFR-ha-UbR48  reference  protein  and  the  test  protein  X-nsP4Pgal.  In  the 
UPR  technique,  the  reference  protein  serves  as  an  internal  control  for  the  levels  of 
expression,  immunoprecipitation  yields,  sample  volumes,  and  other  sources  of 
sample-to-sample  variation,  thereby  increasing  the  accuracy  of  pulse-chase  assays  (40, 
60). 

The  previously  introduced  term  IDX,  initial  decay,  i.  e.,  the  extent  of 
degradation  of  a  protein  during  the  pulse  of  x  minutes  (40),  was  used  to  describe  the 
in  vivo  decay  curves  of  test  proteins.  The  IDX  term  is  superfluous  in  the  case  of  a 
strictly  first-order  decay,  which  is  defined  by  a  single  half-life.  However,  the  in  vivo 
degradation  of  most  proteins  deviates  from  first-order  kinetics.  Specifically,  the  rate 
of  degradation  of  short-lived  proteins  can  be  much  higher  during  the  pulse,  in  part 
because  a  newly  labeled  (either  nascent  or  just-completed)  polypeptide  is 
conformationally  immature,  and  may,  consequently,  be  targeted  for  degradation  more 
efficiently  than  its  mature  counterpart.  This  enhanced  early  degradation,  previously 

termed  the  "zero-point"  effect  (5),  is  described  by  the  parameter  IDX  (40).  It  was  found 
that  a  large  fraction  of  the  zero-point  effect  results  from  the  cotranslational 
degradation  of  nascent  (being  synthesized)  polypeptides,  which  never  reach  their 
mature  size  before  their  destruction  by  processive  proteolysis  (G.  Turner  and 
A.  Varshavsky,  unpublished  data). 

Asn-nsP4Pgal  was  metabolically  unstable  in  +/+  EF  cells:  its  ID10  (percent 
degradation  during  the  10-min  pulse)  was  ~45%,  in  comparison  to  ID10  of  the 
long-lived  Met-nsP4(3gal,  taken  as  100%  (Fig.  4B,  D).  However,  the  same  Asn-nsP4Pgal 
was  completely  (and  reproducibly)  stabilized  in  the  Ntanl~/~  EF  cells,  its  pulse-chase 
pattern  becoming  indistinguishable  from  the  normally  long-lived  Met-nsP4Pgal.  In 
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contrast,  both  Gln-nsP4pgal  and  Arg-nsP4pgal  were  comparably  short-lived  in  both 
+/+  and  Ntanl'/'  cells  (Fig.  4B-D).  Taken  together  with  the  measurements  of 

NtN-amidase  enzymatic  activity  (Fig.  4A),  these  data  indicated  that  the  Ntanl"/"  EF 
cells,  in  contrast  to  congenic  wildtype  cells,  lacked  the  Asn-specific  branch  of  the 
N-end  rule  pathway  but  retained  the  rest  of  this  pathway. 

Motor  coordination  and  spontaneous  activity  of  Ntanl'f'  mice.  The  Ntanl'/' 
mice  were  fertile,  apparently  healthy,  of  similar  size  and  weight  as  +/+  littermates, 
and  cared  for  their  offspring.  These  mice  oriented  to  sound;  their  limb  movements 
and  behavior  appeared  to  be  indistinguishable  from  those  of  congenic  +/+  mice. 

Histological  examination  of  Ntanl"/"  tissues  (small  intestine,  liver,  pancreas,  adrenal 
gland,  thyroid  gland,  kidney,  ovary,  testis,  heart,  spleen,  thymus,  skeletal  muscle, 
brain,  and  sciatic  nerve)  did  not  detect  abnormalities.  No  significant  differences  were 

observed  between  +/+  and  Ntanl'/'  thymocytes  in  their  ability  to  undergo  apoptosis 

in  response  to  either  radiation  or  dexamethasone,  suggesting  that  Ntanl '/'  mice  were 
not  impaired  in  apoptotic  responses.  Thus  far,  the  only  non-behavioral  differences 

between  +/+  and  congenic  Ntanl'/"  mice  were  a  weaker  mitogenic  response  of 
Ntanl"/'  splenocytes  and  thymocytes  to  phytohemagglutinin,  and  hypersensitivity  of 
Ntanl'/'  mice  to  bacterial  lipopolysaccharide  (preliminary  data;  not  shown).  In 

addition,  over  the  last  1.5  years  the  frequency  of  natural  death  among  Ntanl'/'  mice 
was  -50%  higher  than  among  congenic  +/  +  mice. 

Motor  coordination  of  Ntanl'/'  and  congenic  +/+  mice  was  compared  using 
the  Rotarod  apparatus.  No  difference  in  the  ability  of  mice  to  stay  on  a  rotating 

horizontal  rod  was  observed  between  the  Ntanl'/"  and  +/+  strains  (Fig.  5A). 
Comparisons  of  the  physical  strength  (the  weight  retention  test),  of  both  coordination 
and  strength  (the  coat-hanger  test),  and  of  the  walking  patterns  (the  hindpaw 

footprint  test)  also  did  not  distinguish  between  Ntanl'/"  and  +/+  mice  (Figure  5B,  C 

and  data  not  shown).  These  and  related  observations  indicated  that  Ntanl"/"  mice 
were  not  impaired  in  either  motor  coordination,  physical  strength,  or  the  learning 
ability  required  for  their  improved  performance  in  repeated  trials  (Figure  A,  C). 
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The  locomotor  activity  of  Ntanl'/'  mice  was  assayed  in  the  open-field  test.  By 
several  criteria,  Ntanl'/'  mice  exhibited  significantly  lower  spontaneous  activity  than 

the  congenic  wildtype  mice  (Fig.  5F).  Specifically,  Ntanl ~l~  mice  traveled  significantly 
less  distance  (Fig.  5Fa),  spent  significantly  more  time  near  the  field's  center  (Fig.  5Fb), 
and  also,  consequently,  less  time  within  1  cm  from  the  walls  (Fig.  5Fc).  These  findings 
will  be  considered  in  conjunction  with  data  from  the  platform-leaving  test  (see  Fig.  6 
and  Discussion). 

Active  and  passive  avoidance  learning.  The  shuttlebox  avoidance  test 

compared  shock-motivated  learning  in  +/+  and  congenic  Ntanl'/'  mice.  In  this 
active  avoidance  learning  procedure,  a  light-conditioned  stimulus  (CS)  precedes,  by 
5  sec,  the  unconditioned  stimulus  (UCS)  of  electric  shock,  which  remains  on  for 
20  sec.  The  animal  can  (i)  cross  to  the  safe  compartment  within  the  5-second  CS 
interval,  thus  avoiding  shock;  (ii)  cross  over  during  the  20-second  UCS  interval,  to 
escape  the  ongoing  shock;  or  (iii)  not  cross  over  and  receive  the  full  20  sec  of  shock 

(the  latter  outcome  is  scored  as  the  null  response).  Neither  +/+  nor  Ntanl "/-  mice 
showed  any  evidence  of  learning  over  the  5  days  of  testing.  Flowever,  there  were 
performance  differences  in  the  distribution  of  their  responses.  The  two  groups 
averaged  only  4  avoidance  responses  per  day,  but  the  +/+  mice  exhibited  more  escape 

responses  and  therefore  fewer  null  responses  than  the  Ntanl'/'  mice  (Fig.  5D). 

Neither  group  showed  evidence  of  learning  during  the  5  days  of  retention  testing,  but 

the  number  of  null  responses  by  Ntanl'/'  mice  decreased  significantly  in  comparison 
to  their  performance  during  learning  (data  not  shown),  indicating  that  memory 
consolidation  had  occurred  in  this  group  between  the  original  testing  and  retesting  7- 
10  weeks  later.  Strain-129  mouse  substrains  are  known  to  perform  poorly  in  active 
avoidance  tests  (7,  53). 

In  the  above  shuttlebox  text,  an  electric  shock  is  used  to  compel  the  animal  to 
make  an  active  response.  By  contrast,  in  passive  learning,  the  shock  is  used  to  compel 
the  animal  to  inhibit  its  behavior.  The  mouse  was  placed  into  a  chamber.  Ten 
seconds  later,  the  chamber  was  illuminated  and  a  guillotine  door  opened,  exposing  a 
dark  chamber.  The  natural  behavior  is  to  move  into  the  dark.  When  this  occurred. 
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the  guillotine  door  closed  and  the  animal  received  0.4  mA  of  footshock  for  2  seconds. 
The  time  it  took  the  mouse  to  enter  the  dark  chamber  (until  the  cutoff  time  of  400  sec) 
was  recorded.  The  mouse  was  put  back  in  its  home  cage;  24  hr  later,  the  mouse  was 
placed  back  into  the  original  chamber,  and  all  stimulus  conditions  were  repeated 
except  that  no  shock  occurred  when  the  mouse  entered  the  dark  chamber.  The  usual 
finding  is  that  the  mouse  takes  longer  to  enter  the  dark  chamber  on  the  second  day 
because  the  prior  shock  experience  inhibits  its  behavior.  Figure  5E  shows  the  time 
before  entry  into  the  dark  chamber  ("latency  time")  on  both  days  for  the  two 

genotypes.  On  the  first  day,  the  Ntanl~/~  mice  took  significantly  longer  to  enter  the 
dark  chamber  than  the  +/+  animals.  On  the  second  day,  the  entry  into  the  dark 

chamber  by  the  Ntanl~/~  mice  was  further  delayed  (indicating  learning),  while  the 
entry  by  the  +/+  mice  was  delayed  even  more,  resulting  in  similar  retention  times  for 
either  genotype  (Fig.  5E). 

The  longer  retention  time  of  the  Ntanl~/~  mice  on  the  first  day  of  passive 
learning,  as  well  as  their  lower  activity  in  the  open-field  test  suggested  that  +/+  and 

congenic  Ntanl~/~  mice  might  differ  in  their  anxiety  levels.  To  investigate  this,  they 
were  tested  in  the  elevated  plus  maze,  which  consists  of  two  open  arms  and  two 
closed  arms,  which  radiate  from  a  center  platform.  The  open  arms  are  more 
anxiety-inducing,  in  that  the  animals  with  higher  anxiety  tend  to  decrease  exploration 

of  the  open  arms  (42).  Although  the  Ntanl~/~  mice  tended  to  explore  the  open  arms 
less  frequently  than  congenic  +/+  mice,  the  observed  difference  was  not  statistically 
significant  (data  not  shown). 

A  socially  conditioned  exploratory  phenotype  of  Ntanfl-  mice.  To  address  the 

finding  of  lower  activity  in  the  Ntanl~/~  mice  (Fig.  5F)  in  a  different  way,  we  devised  a 
simple  test  in  which  the  exploratory  activity  was  assessed  in  the  context  of  a  social 

interaction.  In  this  "platform-leaving"  test,  two  previously  untested  mice,  Ntanl -/- 
and  a  congenic  +/+,  are  placed,  close  together  and  at  the  same  time,  onto  the  center  of 
a  16  x  22  cm  platform  2.7  cm  in  height.  Thereafter  one  records  the  following  variables: 
the  time  it  takes  each  mouse  to  leave  the  platform,  until  the  cutoff  time  of  3  min;  the 
number  and  genotype  of  mice  leaving  the  platform  first  (in  independent  trials  with 
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pairs  of  mice);  and  the  number  and  genotypes  of  mice  not  leaving  the  platform  by  the 
cutoff  time  (see  Materials  and  Methods).  In  contrast  to  the  standard  open-field  test, 
a  mouse  in  the  platform-leaving  test  sees  and  smells  the  second  mouse  on  the  same 
platform,  and  is  otherwise  influenced  by  the  second  mouse  in  the  course  of  their 
peregrinations  over  the  platform.  Thus  the  proclivity  of  a  mouse  to  explore  its 
environment  is  modulated,  in  the  platform-leaving  test,  by  interactions  with  another 
mouse. 

A  striking  result  of  this  test  was  the  observation  that  an  Ntanl'/'  mouse  paired 
with  its  congenic  +/+  littermate  tended  to  stay  longer  on  the  platform,  and  almost 
never  left  it  by  the  cutoff  time,  in  contrast  to  the  +/+  mouse  (Fig.  6A).  For  example,  in 

a  trial  with  44  pairs  of  previously  untested  Ntanl'/'  and  +/+  mice,  only  1  Ntanl -/- 
mouse  left  the  platform  by  the  cutoff  time,  whereas  20  +/+  mice  left  it  by  that  time 
(Fig.  6Ac).  Repeated  trials,  at  1-hour  intervals,  in  the  platform-leaving  test 

progressively  decreased  this  initial  difference  between  the  paired  Ntanl'/'  and  +/+ 
mice  (Fig.  6A).  Note  that  this  decrease  resulted  from  both  an  increase  in  the 

exploratory  activity  of  Ntanl'/'  mice  (in  successive  trials)  and  a  decrease  in  the 
exploratory  activity  of  their  paired  +/+  littermates  (e.  g..  Fig.  6Ac).  Thus,  the  strongly 

different  initial  responses  of  paired  Ntanl'/'  and  +/+  mice  to  the  novel  environment 
(of  which  the  other  mouse  was  a  part)  became  attenuated  in  subsequent  trials  (Fig. 

6A),  and  nearly  disappeared  by  the  7th  trial  (data  not  shown). 

We  then  asked  whether  the  observed  difference  between  Ntanl'/'  and  +/+ 
mice  was  retained  if  the  same  test  was  performed  with  congenic  non-littermates . 
Strikingly,  if  the  test  of  Figure  6A  was  carried  out  with  wildtype  and  mutant 
non-littermates,  the  results  were  reproducibly  different  from  those  with  littermates 

(Fig.  6B).  Specifically,  in  this  case  Ntanl'/'  mice  did  not  stay  longer  on  the  platform: 
they  left  it,  on  average,  more  frequently  than  +/+  mice,  a  pattern  opposite  to  that  seen 
with  littermates  (Fig.  6Bb;  compare  with  Fig.  6Ab).  In  addition,  the  striking  difference 

between  Ntanl'/'  and  +/+  littermates  in  their  frequency  of  not  leaving  the  platform 
by  the  cutoff  time  was  no  longer  seen  in  the  otherwise  identical  tests  with 
non-littermates  (Fig.  6Ac;  compare  with  Fig.  6Bc). 
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In  the  control  experiments,  multiple  pairs  of  littermates  of  the  same  genotype 

(either  both  +/  +  or  both  Ntanl~/~)  were  subjected  to  this  test.  No  statistically 
significant  differences  between  the  platform-leaving  patterns  of  the  two  mice  in  a  pair 
were  observed  (data  not  shown).  The  results  described  in  Figure  6  were  consistently 
reproduced  in  4  independent  sets  of  tests,  carried  out  over  6  months  with  previously 
untested  mice  (at  least  10  pairs  of  mice  per  test).  This  socially  conditioned  exploratory 

phenotype  of  Ntanl~/~  mice  (Fig.  6)  has  not  been  previously  described,  to  our 
knowledge,  with  other  mouse  strains. 

Spatial  learning  and  memory.  To  compare  spatial  learning  of  +/+  and 

congenic  Ntanl~/~  mice,  we  tested  them  on  the  hidden  platform  Morris  maze  (where 
a  mouse  had  to  find  a  platform  submerged  in  water  pool),  using  the  "reversal 
learning"  regimen  (see  Materials  and  Methods)  and  re-testing  7  or  8  weeks  later.  Both 

+/+  and  Ntanl~/~  mice  learned  to  find  the  platform,  and  did  not  differ  in  either  the 
time  they  took  to  do  it  or  the  distance  traveled;  they  also  did  not  differ  on  these 
measures  in  the  subsequent  reversal-learning  test,  when  the  platform  was  moved  to 
the  diagonally  opposite  quadrant  (data  not  shown).  However,  7-8  weeks  later,  on  the 

first  day  of  retention  testing,  the  Ntanl~/~  mice  were  inferior  to  congenic  +/+  mice,  in 
that  they  took  significantly  longer  time  to  find  the  platform,  and  traveled  a 
significantly  longer  distance  before  reaching  it  (data  not  shown),  suggesting  that  the 
Ntanl~/~  mice  have  a  less  effective  spatial  memory  system. 

Another  way  to  study  spatial  learning  employs  the  water  version  of  the  radial 
arm  maze,  where  4  of  the  8  arms  contain  escape  platforms  (30).  There  are  no  intra¬ 
maze  cues  available  to  the  mouse,  and  it  must  use  extra-maze  spatial  information 
involving  reference  and  working  memory  to  remember  which  alleys  to  enter  and 
which  to  avoid.  The  mice  were  given  one  training  day  and  were  tested  for  11 
consecutive  days,  receiving  4  trials  each  day.  The  measure  used  was  the  total  number 

of  errors.  Ntanl~/~  mice  made  significantly  more  errors  in  the  acquisition  phase 
(sessions  2-7)  than  the  congenic  +/+  mice  (Fig.  7A).  In  the  asymptotic  phase  (sessions 
8-12),  there  were  no  significant  differences  between  the  two  strains  (Fig.  7A). 
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The  4  trials  within  each  daily  session  impose  an  increasing  demand  on 
memory.  Thus,  on  the  first  trial,  there  are  platforms  in  4  of  the  8  alleys,  and  entering 
any  of  these  terminates  the  trial.  On  the  second  trial,  3  of  the  alleys  still  contain 
platforms;  the  mouse  has  to  remember  which  alley  it  had  entered  on  the  prior  trial, 
and  not  enter  that  alley  again.  Finally,  on  the  fourth  trial,  the  mouse  has  to 
remember  which  three  alleys  it  had  chosen  earlier,  so  that  it  can  find  the  one 
remaining  alley  containing  a  platform.  Thus,  by  studying  the  error  rate  from  trial  to 
trial,  summed  over  the  sessions,  one  can  compare  the  ability  of  mice  to  cope  with  an 
increasing  memory  load.  Figure  7C  shows  the  error  curves  summed  over  the 
acquisition  phase.  As  expected,  errors  increased  as  the  memory  load  increased. 

However,  the  rate  of  increase  was  significantly  greater  for  the  Ntanl~/~  mice, 
especially  on  Trial  4  (Fig.  7C). 

Unlike  previously  tested  mouse  strains  (30),  both  +/+  and  congenic  Ntanl~/~ 
mice  exhibited  poor  learning  over  the  11  days  of  testing,  consistent  with  inferior 
performance  of  strain  129-derived  mice  in  several  other  learning  regimens  (7,  53). 
Thus,  the  significant  effects  described  above  (Fig.  7A,  C)  represent  largely  differences 
in  performance,  but  say  little  about  the  spatial  learning  ability  of  these  mice.  To 
determine  whether  the  observed  effects  were  related  to  spatial  competence,  the 
identical  experiment  was  carried  out  using  the  same  radial-arm  maze,  except  that  a 
non-spatial  version  was  devised,  by  having  different  visual  patterns  in  each  alley  and 
rotating  the  maze  between  trials  so  that  the  extra-maze  cues  were  rendered 
meaningless.  In  4  of  the  arms,  escape  platforms  were  placed,  and  the  mice  had  to 
learn  to  associate  a  specific  visual  configuration  (e.g.,  black  dots  on  a  white 
background,  or  horizontal  black  and  white  stripes)  with  the  presence  or  absence  of  a 
platform  (29).  In  contrast  to  the  results  with  the  spatial  radial-arm  maze,  no 
significant  differences  between  the  two  genotypes  were  observed  in  the  non-spatial 
radial-arm  maze,  either  in  regard  to  the  overall  number  of  errors  (Fig.  7B)  or  in  regard 
to  the  error  rate  as  a  function  of  memory  load  (data  not  shown).  Here,  too,  both  +/+ 

and  congenic  Ntanl~/~  mice  did  not  exhibit  significant  learning  over  the  11  days  of 
testing.  Thus,  no  conclusions  can  be  drawn  about  spatial  or  non-spatial  learning.  At 
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the  same  time,  it  was  possible  to  conclude  that  Ntanl'/~  mice  were  less  effective  in 
dealing  with  spatial  information  than  the  congenic  +/+  mice. 

Another  test  of  spatial  learning  and  memory  was  the  Lashley  III  maze.  This 
maze  contains  cul-de-sacs  that  an  animal  has  to  learn  to  avoid,  and  T-choices  that 

require  the  animal  to  make  the  correct  left  or  right  turn.  Both  +/+  and  Ntanl~/~  mice 

were  able  to  learn  the  maze,  but  Ntanl~/~  mice  were  significantly  less  effective,  as 
measured  by  the  learning  index,  T-forward  errors  (making  a  wrong  T-choice),  and  the 

total  number  of  backward  errors  (Fig.  7D).  When  retested  7  or  8  weeks  later,  Ntanl~l~ 
mice  had  lower  scores  on  the  first  trial  on  all  three  measures,  showing  inferior 
long-term  retention  (data  not  shown).  These  results  were  similar  to  the  findings  with 
the  Morris  maze  (see  above),  except  that  there  was  a  significant  difference  in  the 
original  learning  of  the  Lashley  maze  favoring  the  +/+  mice  (Fig.  7D).  This  difference 
may  stem  from  the  fact  that  the  Lashley  maze  can  be  learned  using  intra-maze  cues  to 
memorize  the  pathway  of  left  and  right  turns,  in  addition  to  the  use  of  extra-maze 
cues,  which  are  present  in  both  the  Lashley  and  Morris  maze  tests.  The  results  of  the 

two  radial-arm  studies  (Fig.  7A-C)  indicated  that  the  Ntanl~/~  mice  used  spatial 
information  less  effectively  than  the  +/+  mice.  In  summary,  the  results  of  spatial 
memory  tests  (Fig.  7)  strongly  suggested  that  mouse  NtN-amidase  contributes  to  the 
processing  of  spatial  information  and  to  long-term  retention  of  spatial  learning. 

Discussion 

Homologs  of  the  310-residue  mammalian  NtN-amidase,  encoded  by  the  Ntanl 
gene  (24,  59),  are  present  in  other  vertebrates,  in  arthropods  such  as  D.  melanogaster, 
and  in  plants  such  as  A.  thaliana,  but  are  apparently  absent  from  the  nematode 
C.  elegans  (data  not  shown).  The  deamidation  of  N-terminal  (and  apparently  only 
N-terminal)  Asn  residue  in  proteins  or  short  peptides  is  the  only  known  enzymatic 
activity  of  NtN-amidase.  This  enzyme  was  operationally  defined  as  a  component  of 
the  N-end  rule  pathway,  because  it  converts,  in  vivo,  the  N-terminal  Asn  of 
engineered  N-end  rule  substrates  into  N-terminal  Asp,  which  is  then  conjugated  by 
R-transferase  to  Arg,  a  primary  destabilizing  residue  (34,  63).  Physiological  substrates 
of  NtN-amidase  remain  to  be  identified.  Physiological  substrates  are  also  unknown 
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for  the  fungal  Nt-amidase,  which  can  deamidate  either  N-terminal  Asn  or 
N-terminal  Gin,  and  lacks  significant  sequence  similarities  to  metazoan 
NtN-amidase  (6).  In  the  present  work,  mouse  NtN-amidase  was  studied  using  several 
approaches,  including  targeted  mutagenesis.  We  report  the  following  results: 

(i)  Both  copies  of  the  mouse  Ntanl  gene,  which  encodes  the  Asn-specific 
N-terminal  amidohydrolase  (NtN-amidase)  were  replaced  by  a  deletion /disruption 

allele.  The  resulting  Ntanl'/'  mice  were  viable  and  fertile.  Their  behavioral 
phenotypes  are  described  below. 

(ii)  Embryonic  fibroblast  (EF)  cells  from  Ntanl'/'  mice  lacked  the  NtN-amidase 
activity,  in  contrast  to  congenic  +/  +  EF  cells.  In  addition,  among  the  normally 
short-lived  substrates  of  the  N-end  rule  pathway,  only  those  bearing  N-terminal  Asn, 
a  tertiary  destabilizing  residue  (see  Introduction),  became  long-lived  in  the  Ntanl'/' 

EF  cells.  Thus,  at  least  these  cells,  and,  by  inference,  the  Ntanl'/'  mice,  lacked  the 
asparagine  branch  of  the  N-end  rule  pathway. 

(iii)  The  Ntanl'/'  EF  cells  contained  wildtype  levels  of  Gin-specific  Nt-amidase 
(NtQ-amidase),  which  mediates  the  activity  of  N-terminal  Gin,  another  tertiary 
destabilizing  residue  in  the  N-end  rule.  Thus,  NtN-amidase  and  NtQ-amidase  (the 
latter  enzyme  remains  to  be  isolated  and  characterized)  are  encoded  by  different  genes. 

(iv)  The  brains  and  testes  of  +/+  mice  contained,  respectively,  the  1.4  and  1.6  kb 
Ntanl  mRNAs.  In  addition,  the  testes,  but  not  the  brains  of  +/+  mice  also  contained 

the  Ntanl -derived  RNAs  of  1.1  and  4  kb.  The  brains  and  testes  of  Ntanl'/'  mice 
lacked,  respectively,  the  1.4  and  1.6  kb  Ntanl  mRNAs.  However,  the  testes  of 

Ntanl'/'  mice  retained  the  testis-specific  1.1  and  4  kb  Ntanl-de rived  RNAs,  which 
were  found  to  lack  the  5,-half  of  the  Ntanl  ORF  (including  the  region  deleted  in 

Ntanl'/'  mice),  and  to  encompass  the  3'  half  of  Ntanl  ORF.  Whether  these 
testis-specific  transcripts  are  physiologically  relevant  in  +/+  mice  remains  to  be 
determined. 

(v)  Deletion /disruption  of  Ntanl  in  the  Ntanl'/'  mice  did  not  significantly 
change  the  levels  of  mRNAs  encoding  other  components  of  the  N-end  rule  pathway. 
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such  as  ATElp  (R-transferase);  UBRlp  (E3a,  the  pathway's  E3  component);  UBR2p 
(a  homolog  of  UBRlp);  and  mHR6Bp  (the  E214K  Ub-conjugating  (E2)  enzyme). 

(vi)  Ntanl,  which  is  expressed,  at  different  levels,  in  most  if  not  all  tissues  of 
adult  mice  (24),  was  found,  through  in  situ  hybridization,  to  be  particularly  strongly 
expressed  in  the  branchial  arches,  the  tail  bud  and  the  limb  buds  of  10  days  old 
embryos.  This  expression  pattern  was  indistinguishable  from  that  of  Ubrl  (36), 
consistent  with  the  assignment  of  NTANlp  (NtN-amidase)  and  UBRlp  (E3a)  to  the 
same  pathway. 

(vii)  An  NTANlp-GFP  fusion  was  enriched  in  the  nucleus  of  transfected 
mouse  3T3  cells,  in  contrast  to  free  GFP,  which  was  uniformly  distributed. 
NTANlp-GFP  was  also  present  in  the  nucleus-proximal  cytoplasm,  but  was 
apparently  much  less  abundant  at  the  cells'  periphery.  This  localization  pattern  of  the 
mammalian  NtN-amidase  is  quite  different  from  that  of  the  S.  cerevisiae 

NTA1  -encoded  Nt-amidase,  most  of  which  is  located  in  the  yeast  mitochondria 
(H.  R.  Wang  and  A.  Varshavsky,  unpublished  data).  In  contrast  to  the  mammalian 
NtN-amidase,  the  yeast  Nt-amidase,  which  has  no  significant  sequence  similarities  to 
the  metazoan  enzyme,  can  deamidate  either  N-terminal  Asn  or  N-terminal  Gin  (6). 

(viii)  While  the  abnormal  phenotypes  of  Ntanl~l~  mice  are  apparently  not 
confined  to  behavioral /cognitive  alterations  (see  Results),  it  is  the  latter  that  have 

been  analyzed,  thus  far,  in  some  detail.  Briefly,  the  Ntanl~/~  mice  were 
indistinguishable  from  congenic  +/+  mice  in  motor  coordination  and  general 
physical  performance,  but  had  reduced  spontaneous  activity  and  less  effective  spatial 

memory.  Remarkably,  the  exploratory  behavior  of  Ntanl~/~  mice  was  found  to  be 
particularly  different  from  that  of  congenic  +/+  mice  in  the  presence  of  social 

interactions.  If  a  previously  untested  Ntanl~/~  mouse  was  placed  on  a  small,  slightly 
elevated  platform  with  its  +/+  littermate  (which  it  grew  up  with  in  the  same  cage), 

the  Ntanl~/~  mouse  left  the  platform  much  more  slowly  than  the  +/+  mouse.  In 

contrast,  when  an  Ntanl~/~  mouse  was  subjected  to  this  test  in  the  presence  of  a 
congenic  non-littermate  +/+  mouse,  it  tended  to  leave  the  platform  faster  than  the 
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+/+  mouse,  in  spite  of  the  fact  that  Ntanl  /'  mice  exhibited  diminished  exploratory 
activity  when  tested  alone  (Fig.  6). 

Our  working  hypothesis  is  that  Ntanl -/-  mice  are  socially  recessive,  in  relation 
to  congenic  +/  +  mice,  and  in  addition  have  a  lower  spontaneous  exploratory  activity. 
In  a  relatively  unstressful  setting,  such  as  when  it  is  placed  on  the  platform 

side-by-side  with  a  (familiar)  littermate,  the  behavior  of  Ntanl' /'  mice  is  governed 
largely  by  their  inherently  lower  exploratory  activity,  yielding  the  observation  that 
these  mice  leave  the  platform  much  more  slowly  than  their  +/+  littermates  (Fig.  6A). 
By  contrast,  in  an  otherwise  identical  test  with  a  +/+  non-littermate,  the  postulated 

social  recessiveness  of  Ntanl'!'  mice  becomes  their  behavior-governing  trait.  As  a 
result,  they  leave  the  platform  faster  than  the  +/+  non-littermates,  to  reduce  the 
proximity-induced  anxiety  (Fig.  6B). 

The  socially  conditioned  exploratory  phenotype  of  Ntanl'/'  mice,  identified 
through  the  platform-leaving  test  (Fig.  6),  has  not  been  previously  described,  to  our 
knowledge,  with  other  mouse  strains.  It  is  difficult  to  compare  this  finding  with 
behavioral  studies  of  other  mouse  mutants,  because  most  of  the  earlier  analyses  of 
exploratory  behavior  employed  solitary  mice.  An  example  of  differential  effect  of 
stress  on  a  specific  mutant  is  provided  by  the  analysis  of  mice  lacking  the  serotonin 
receptor  1A.  These  mice  exhibited  decreased  exploratory  activity  and  increased  fear  of 
aversive  environments  (in  comparison  to  +/+  mice),  as  measured  by  the  open  field 
test  and  the  elevated  plus-maze  test,  respectively.  However,  the  mutant  mice  were 
indistinguishable  from  +/+  littermates  in  a  less  stressful  setting  such  as  their  home 
cages  (50). 

A  parsimonious  molecular  interpretation  of  our  results  (Figs.  5-7)  is  that  a 
normally  short-lived  regulatory  protein(s)  that  is  targeted  for  degradation  by  the 
N-end  rule  pathway  through  the  protein's  N-terminal  Asn  becomes  long-lived  in 
Ntanl'/-  mice.  The  resulting  increase  in  the  steady-state  concentration  of  this 
protein(s)  alters  the  functioning  of  relevant  neural  networks  in  the  adult  brain  or, 
non-alternatively,  changes  these  networks  in  the  course  of  their  establishment  during 
development.  That  the  inactivation  of  a  Ub-dependent  proteolytic  pathway  could 
affect  cognitive  functions  is  illustrated,  for  example,  by  the  identification  of  UBE3A  as 
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a  human  gene  encoding  an  E3  protein  of  the  Ub  system  called  E6AP.  This  gene  is 
mutated  in  the  Angelman's  syndrome,  the  symptoms  of  which  include  motor 
disfunction  and  mental  retardation  (33,  45).  Mice  lacking  UBE3A  exhibit  deficits  in 
contextual  learning  and  long-term  potentiation  (31). 

The  absence  of  severe  impairments  in  the  Ntanl~l~  mice  should  make  them 
particularly  suitable  as  vehicles  for  the  approaches  to  conditional  mutagenesis  that 
utilize  conditional  destabilization  of  a  protein  of  interest.  One  version  of  this 
approach  could  utilize  a  transgenic  mouse  strain  that  lacks  endogenous  Ntanl  and 
expresses  this  gene  from  one  of  the  previously  constructed  promoters  whose  activity 
can  be  controlled  by  small  molecules  such  as  tetracycline  or  ecdysone  (9).  If  a  protein 
of  interest  is  modified,  through  knock-in  mutagenesis  and  the  Ub  fusion  technique 
(63),  to  bear  an  Asn-containing  N-degron,  the  resulting  protein  would  be  long-lived 
in  the  absence  of  the  Ntanl-encoded  NtN-amidase  but  short-lived,  and  therefore 
scarce,  in  the  presence  of  NtN-amidase.  In  this  method,  the  steady-state  level,  and 
hence  the  activity  of  a  protein  of  interest,  is  controlled  through  Ntanl-dependent, 
regulated  changes  of  the  protein's  metabolic  stability.  One  advantage  of  this  strategy  is 
that  it  does  not  require  alterations  of  a  promoter  that  expresses  a  gene  of  interest, 
thereby  avoiding  potential  perturbations  of  the  promoter  regulation  during 
development  and  differentiation.  This  approach  may  be  combined  with  the  existing 
technologies  for  conditional  mutagenesis  of  the  mouse  (13,  55),  enhancing  them 
through  regulated  degradation  of  a  protein  of  interest. 

Inasmuch  as  behavior  is  an  emergent  property  of  highly  complex  neural 
networks  interacting  with  the  musculoskeletal  apparatus,  a  behavioral  alteration  that 
is  caused  by  the  absence  of  a  specific  protein  from  the  brain  is  difficult  to  understand 
in  terms  of  higher-order  neural  events  even  in  the  case  of  a  protein  (e.  g., 
a  neurotransmitter-gated  ion  channel)  whose  "local"  molecular  function  in  the 
individual  neurons  is  clear.  The  difficulty  is  further  increased  in  the  case  of  a  protein 
such  as  NtN-amidase,  whose  physiological  substrates  remain  to  be  identified.  At  the 
same  time,  the  absence  of  a  priori  reasons  to  suspect  a  specific  role  for  NtN-amidase  in 
the  brain's  functions,  and  the  robustness  of  behavioral  differences  between  the 

Ntanl~/~  and  congenic  +/+  mice  (Figs.  5-7)  make  these  findings  particularly 
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intriguing.  Further  advances  in  this  inquiry  will  require,  at  minimum,  the 

identification  of  physiological  substrates  of  NtN-amidase.  The  availability  of  Ntanl~ A 
mice  and  cell  lines  derived  from  them  should  facilitate  the  discovery  of  these 
substrates. 
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Footnote 

^Throughout  the  text,  the  names  of  mouse  genes  are  in  italics,  with  the  first 
letter  upper-case.  The  names  of  human  and  Saccharomyces  cerevisiae  genes  are  also 
in  italics,  all  upper-case.  If  human  and  mouse  genes  are  named  in  the  same  sentence, 
the  mouse  gene  notation  is  used.  The  names  of  S.  cerevisiae  proteins  are  Roman, 
with  the  first  letter  upper-case  and  an  extra  lower-case  "p"  at  the  end.  The  names  of 
mouse  and  human  proteins  are  the  same,  except  that  all  letters  but  the  last  "p"  are 
upper-case.  The  latter  usage  is  a  modification  of  the  existing  convention  (58),  to 
facilitate  simultaneous  discussions  of  yeast,  mouse  and  human  proteins.  In  some 
citations,  the  abbreviated  name  of  a  species  precedes  the  gene's  name.) 
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Legends  to  Figures 

figure  1.  Deletion/disruption  of  the  mouse  Ntanl  gene.  (A)  Comparison  of 
enzymatic  reactions  that  underlie  the  activity  of  tertiary  and  secondary  destabilizing 
residues  in  the  yeast  S.  cerevisiae  and  the  mouse.  N-terminal  residues  are  indicated 
by  single-letter  abbreviations  for  amino  acids.  The  ovals  denote  the  rest  of  a  protein 
substrate.  The  Ntanl -encoded  mammalian  NtN-amidase  converts  N-terminal  Asn 
into  Asp.  N-terminal  Gin  is  deamidated  by  NtQ-amidase,  which  remains  to  be 
isolated  (see  the  main  text).  In  contrast,  the  yeast  Nt-amidase  Ntalp  can  deamidate 
either  N-terminal  Asn  or  Gin  (6).  The  secondary  destabilizing  residues  Asp  and  Glu 
are  arginylated  by  the  mammalian  ATEl-lp  or  ATEl-2p  R-transferases  (34). 

A  Cys-specific  mammalian  R-transferase  (23)  remains  to  be  identified.  N-terminal 
Arg,  one  of  the  primary  destabilizing  residues,  is  recognized  by  N-recognin,  the  E3 
component  of  the  N-end  rule  pathway  (63).  (B)  The  targeting  strategy.  Top,  a  partial 
restriction  map  of  the  mouse  Ntanl  gene;  middle,  the  structure  of  the  targeting 

vector;  bottom,  the  structure  of  the  deletion/disruption  Ntanl'  allele.  Exons  are 
denoted  by  solid  vertical  bars.  The  directions  of  transcription  of  the  neomycin  ( neo ) 
and  the  thymidine  kinase  ( tk )  genes  are  indicated.  Homologous  recombination 
resulted  in  the  replacement  of  the  Ntanl  exons  2-5  with  the  neo  cassette.  Probes  for 
Southern  hybridization  are  indicated  by  solid  rectangles.  Restriction  sites:  Xh,  Xhol ; 
R,  EcoRI;  BI,  BamHI;  H,  Hzzzdffl;  P,  Pstl.  (C)  Southern  analysis  of  BzzzzzHI-digested 

tail  DNA  from  wildtype  (+/+),  heterozygous  ( Ntanl+ /-),  and  Ntanl'/'  mice.  The 
5'-probe  yielded  the  12  and  1.7  kb  Ntanl  fragments  for  the  wildtype  and  mutant  Ntanl 
alleles,  respectively;  the  3'-probe  detected  12  and  7  kb  fragments.  The  organization  of 
the  deletion/ disruption  allele  was  independently  verified  by  Southern  analysis  of  the 
X/zoI-Hzndlll-digested  tail  DNA  (data  not  shown).  (D)  PCR  analysis  of  tail  DNA.  The 
primers  were  5'-GCCACTTGTGTAGCGCCAAGTGCCAGC  (for  neo,  forward), 
S'-CTTCCCACCAAGCCTGACTGTTGATC  (for  Ntanl,  forward)  and 
5'~CTTCAATTTCTGTGCTCAGCTAAGCTC  (for  Ntanl,  reverse).  (E)  RT-PCR  analysis 

of  the  total  RNA  isolated  from  +/+  and  Ntanl'/'  EF  cells,  using  primers  PI  (for 
exon  1),  P2  (exon  2),  P3  (exon  6),  P4  (exon  5),  and  P5  (exon  10).  p-actin  mRNA  was 
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used  as  a  control,  at  the  20-fold  lower  primer  concentration  in  comparison  to  other 
lanes. 

figure  2.  Northern  hybridization  analysis  of  the  total  RNA  isolated  from  +/  +  and 

Ntanl'/'  brains  and  testes.  (Aa-Af)  Hybridization  using  probes  a-f  that  encompassed 
different  regions  of  the  Ntanl  cDNA  (indicated  in  C).  (Ba-Bc)  The  same  Northern 
blots  were  hybridized  with  probes  specific,  respectively,  for  the  mouse  Ubrl,  Atel  and 
P-actin  cDNAs.  (C-)  Exons  of  Ntanl  and  the  hybridization  probes  used.  The  sequence 
of  a  208  bp  segment  of  the  Ntanl  cDNA  (nt  896-930),  termed  the  IL2  homology  region, 
is  98.6%  identical  to  the  sequence  of  a  206  bp  segment  in  the  3'-flanking  untranslated 
region  of  the  mouse  112  gene  that  encodes  interleukin-2  (IL-2)  (24). 

figure  3.  Expression  of  Ntanl  mRNA  and  localization  of  NTANlp. 

(A)  Whole-mount  in  situ  hybridization  of  wildtype  and  Ntanl' /'  embryos  (left  4 
panels,  light  background)  with  an  antisense  RNA  probe  derived  from  a  0.3  kb 

fragment  of  the  Ntanl  cDNA  (nt  108-448)  that  was  absent  from  the  Ntanl'/'  allele. 
The  regions  of  high  Ntanl  expression  in  the  tail  buds  (t),  forelimb  buds  (fl),  and 
hindlimb  buds  (hi)  are  indicated.  The  right  two  panels  (dark  background)  show  the 
results  of  in  situ  hybridization  with  an  antisense  (AS)  Ubrl  cDNA  probe. 

(B)  Intracellular  localization  of  mouse  NTANlp.  NIH-3T3  cells  were  transiently 
transfected  with  a  plasmid  expressing  the  NTANlp-GFP  fusion  protein  (see  Materials 
and  methods).  Typical  fluorescence  patterns  and  the  matching  phase  contrast  images 
are  shown,  respectively,  in  subpanels  a,  c,  e  and  b,  d,  f  (see  the  main  text). 

figure  4.  Mouse  Ntanl'/'  cells  lack  NtN-amidase  and  are  unable  to  degrade  the 
normally  short-lived  N-end  rule  substrates  bearing  N-terminal  Asn.  (A)  35S-labeled, 
purified,  X-DHFR  test  proteins  (X=Asn,  Gin  or  Asp)  were  incubated  for  2  hr  at  37°C 
with  buffer  alone  (negative  controls,  NC)  or  with  extracts  from  either  wildtype  or 

Ntanl'/'  embryonic  fibroblasts  (EFs),  followed  by  isoelectric  focusing  (IEF)  and 
autoradiography.  The  assays  were  carried  using  either  the  initial  extracts  (lanes  a)  or 
the  same  extracts  diluted  with  buffer  by  10-,  100-,  and  1,000-fold  (lanes,  b,  c,  and  d. 
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respectively).  The  IEF  positions  of  X-DHFRs  bearing  N-terminal  Asp  or  Glu  versus 
Asn  or  Gin  are  shown  on  the  left.  The  corresponding  pH  values  are  indicated  on  the 

right.  (B)  Immortalized  +/+  and  Ntanl~/~  EF  cells  (see  Materials  and  methods)  were 
transiently  transfected  with  the  plasmids  pRC/dhaUbXnsP4Pgal,  which  expressed 
DHFR-ha-UbR48-X-nsP4Pgal  test  proteins  (X=Met,  Asn,  Gin,  or  Arg)  (40).  These 
proteins  were  cotranslationally  cleaved  in  vivo  by  deubiquitylating  enzymes  (DUBs), 
yielding  the  long-lived  reference  protein  DHFR-ha-UbR48  and  a  test  protein 
X-nsP4Pgal  (X=Met,  Asn,  Gin,  or  Arg)  (see  the  main  text).  Cells  were  labeled  with 
35S-methionine/ cysteine,  followed  by  a  chase  for  0, 1,  and  2  hr  (as  indicated  at  the  top 
of  the  panel)  in  the  presence  of  cycloheximide,  preparation  of  extracts, 
immunoprecipitation,  SDS-PAGE,  autoradiography  and  quantitation,  essentially  as 
described  (40).  The  bands  of  X-nsP4pgal  proteins  and  the  DHFR-ha-UbR48  reference 
protein  are  indicated  on  the  right  as  X-Pgal  and  DHFR.  (C)  Same  as  B  but  with 

immortalized  Ntanl~ <  EF  cells.  (D)  Quantitation  of  the  in  vivo  degradation  of  X- 
nsP4Pgal  test  proteins  using  the  reference-based  pulse-chase  patterns  in  B  and  C  (see 
Materials  and  methods).  The  amounts  of  35S  in  an  X-nsP4Pgal  protein,  relative  to 
35S  in  the  DHFR-ha-UbR48  reference  protein  at  the  same  time  points,  were  plotted  as 
percentages  of  this  ratio  for  Met-nsP4Pgal  at  time  0.  (Met-nsP4pgal  bore  a  stabilizing 

N-terminal  residue.)  Open  and  closed  symbols:  wildtype  and  Ntanl "/-  EF  cells, 
respectively.  □,  ■:  Met-nsP4pgal;  O,  •:  Asn-nsP4pgal;  V,  ▼:  Arg-nsP4pgal;  A,  A: 
Gln-nsP4Pgal. 

FIGURE  5.  Normal  motor  coordination  and  reduced  spontaneous  activity  of 
Ntanl~/~  mice.  Open  symbols:  +/+mice.  Solid  symbols:  congenic  (same-strain 
background)  Ntanl'l ~  mice  (see  Materials  and  methods).  Statistically  significant 

differences  (p<0.05)  are  indicated  by  *.  24  Ntanl~/~  mice  and  their  congenic  +/+ 
littermates  were  used  in  the  experiments  of  A-C.  Non-littermates  were  used  in  the 
experiments  of  D-G.  (A)  The  rotarod  test.  Time  elapsed  before  the  animals  fell  from 
a  horizontal  rod  rotating  at  10  rpm  (squares)  or  20  rpm  (circles).  No  significant 

differences  between  +/  +  and  Ntanl~l~  mice.  (B)  Weight  retention  test.  Time  elapsed 
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before  the  animals  released  a  hook.  The  height  of  each  bar  represents  the  time 

averaged  from  360  trials.  No  significant  differences  between  +/+  and  Ntanl~/~  mice. 
(C)  Coat  hanger  test.  Squares:  time  elapsed  before  the  animal  grabbed  the  horizontal 
bar  with  both  rear  paws.  Circles:  time  elapsed  before  the  front  two  paws  reached  one 
of  the  side  bars.  Each  data  point  is  an  average  of  48  trials.  No  significant  differences 

between  +/+  and  Ntanl~/~  mice.  (D)  Shuttlebox  avoidance  test.  Number  of  null 
responses  per  50  daily  trials  in  which  the  mouse  remained  in  the  original 
compartment  of  shuttle  box  and  received  20  sec  of  electric  shock  during  the  original 
learning  and  retention  testing  7-10  weeks  later.  (E)  Passive  avoidance  test.  Time 
before  entering  dark  chamber  after  an  automated  guillotine  door  opened,  exposing 
the  dark  chamber.  On  Day  1  the  mice  were  shocked  after  entering  the  dark  chamber. 

13  Ntanl~/~  mice  and  12  congenic  +/  +  mice  were  used.  (Fa-Fd)  Open  field  test. 

(Fa)  The  total  distance  traveled  by  the  animal  during  the  observation  time  of  9  min. 
(Fb)  Time  (out  of  9  min  total)  that  the  animal  spent  in  the  center  area.  (Fc)  Time  (out 

of  9  min  total)  that  the  animal  spent  within  1  cm  of  the  walls.  28  Ntanl~/~  mice  and 
27  congenic  +/+  mice  were  used  in  this  set  of  tests. 

figure  6.  Socially  conditioned  differences  in  the  exploratory  behavior  between 
+/+  and  congenic  Ntanl~/~  mice.  Open  bars,  +/+  mice;  solid  bars,  congenic 

(same-strain  background)  Ntanl~/~  mice.  Platform-leaving  test:  one  Ntanl~l~  mouse 
and  one  +/+  mouse  (either  a  littermate  or  a  non-littermate)  were  placed,  at  the  same 
time,  on  the  16  x  22  x  2.7  cm  platform.  Mice  were  allowed  to  either  explore  or  step 
down  from  the  platform  until  the  cutoff  time  of  180  sec.  (Aa-Ac)  Pairs  of  littermates. 
The  time  by  which  the  animal  left  the  platform  (a);  the  number  of  animals,  of  each 
genotype,  leaving  first  (b);  and  the  number  of  animals,  of  each  genotype,  not  leaving 
by  the  cutoff  time  (c).  Four  independent  sets  of  tests  (at  least  10  pairs  of  mice  per  test) 

were  carried  out  over  6  months,  using  44  pairs  of  +/+  and  Ntanl~/~  littermates,  which 
were  produced  through  heterozygous  (+/-)  matings  and  identified  by  genotyping 
-700  mice.  The  results  differed  by  less  than  15%  among  the  independent  tests. 

(Ba-Bc)  Same  as  A  except  with  pairs  of  non-littermates.  24  +/  +  mice  and  24  Ntanl~/~ 
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mice  were  examined  in  this  test. 

FIGURE  7.  Spatial  learning  and  memory  in  Ntanl~/~  mice.  Open  symbols  or  bars 
represent  +/+  mice.  Solid  symbols  or  bars  represent  congenic  (same-strain 
background)  Ntanl' /"  mice.  See  the  main  text  and  Materials  and  methods  for  detail. 
10  +/+  mice  and  10  Ntanl'/'  mice  were  examined  in  the  tests  of  A-C.  (A)  Mean  daily 
errors  made  by  mice  for  each  phase  in  the  spatial  radial  arm  maze.  The  data  were 
separated  into  those  for  acquisition  phase  (sessions  2-7)  and  asymptotic  phase 
(sessions  8-12).  (B)  Same  as  A  except  that  the  mice  were  tested  in  the  non-spatial 
radial  arm  maze,  in  which  the  spatial  cues  were  replaced  by  non-spatial  cues. 

(C)  Effects  of  memory  load  in  the  water  version  of  the  spatial  radial-arm  maze. 

Shown  are  total  errors  per  trial.  (Da-Dc)  Performance  of  +/+  (open  circles;  total  of  29 

mice)  and  congenic  Ntanl ~/~  mice  (solid  circles;  total  of  30  mice)  in  the  Lashley  maze. 
The  ratio  of  the  number  of  correct  path  segments  taken  to  the  total  number  of 
segments  (learning  index)  (a);  errors  in  the  T-choices  (making  the  correct  left  or  right 
turn)  (b);  errors  made  by  moving  away  from  the  goal  (c). 
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Abstract 

Nascent  polypeptides  emerging  from  the  ribosome  and  not  yet  folded  may  at 
least  transiently  present  degradation  signals  similar  to  those  recognized  by  the 
ubiquitin  system  in  misfolded  proteins.  Thus,  the  folding  of  nascent  proteins, 
including  abnormal  ones,  may  be  in  kinetic  competition  with  pathways  that  target 
these  proteins  for  degradation  cotranslationally.  Here  we  describe  the  ubiquitin 
sandwich  technique  that  allows  the  detection  and  measurement  of  cotranslational 
protein  degradation  in  living  cells.  We  use  this  method  to  demonstrate  that  more 
than  50%  of  nascent  protein  molecules  bearing  an  N-terminal  degradation  signal 
can  be  degraded  cotranslationally,  never  reaching  their  mature  size  before  their 
destruction  by  processive  proteolysis. 
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Nascent  (being  synthesized)  polypeptides  emerging  from  the  ribosome  may, 
in  the  process  of  folding,  present  degradation  signals  similar  to  those  recognized  by 
the  ubiquitin  system  in  misfolded  or  otherwise  damaged  proteins  (2).  It  has  been  a 
long-standing  question  whether  a  significant  fraction  of  nascent  polypeptides  is 
cotranslationally  degraded  (2).  Determining  whether  nascent  polypeptides  are 
actually  degraded  in  vivo  has  been  difficult  because  at  any  given  time  the  nascent 
chains  of  a  particular  protein  species  are  of  different  sizes,  and  therefore  would  not 
form  a  band  upon  electrophoresis  in  a  conventional  pulse-chase  assay  (3).  The 
ubiquitin  (Ub)  sandwich  technique  makes  it  possible  to  detect  cotranslational 
protein  degradation  by  measuring  the  steady-state  ratio  of  two  reporter  proteins 
whose  relative  abundance  is  established  cotranslationally. 

The  polypeptide  to  be  examined  for  cotranslational  degradation,  termed  B,  is 
sandwiched  between  two  stable  reporter  domains  A  and  C  in  a  linear  fusion 
protein.  The  three  polypeptides  are  connected  via  Ub  moieties  to  create  a  fusion 
protein  of  the  form  AUb-BUb-CUb  (Fig.  1A).  Ub-specific  processing  proteases 
(UBPs)  (4)  cotranslationally  cleave  such  linear  Ub  fusions  after  the  last  residue  of 
Ub  (5-8).  The  independent  polypeptides  AUb,  BUb  and  CUb  that  result  from  the 
cleavage  of  AUb-BUb-CUb  are  called  modules  below. 

UBP-mediated  cleavage  establishes  a  kinetic  competition  between  two 
mutually  exclusive  events  during  the  synthesis  of  the  AUb-BUb-CUb  fusion: 
cotranslational  UBP  cleavage  at  the  BUb-CUb  junction  to  release  the  long-lived 
CUb  module  or,  alternatively,  cotranslational  degradation  of  the  entire  BUb-CUb 
nascent  chain  by  the  26S  proteasome  (9)  (Fig.  IB).  In  the  latter  case,  the  processivity 
of  proteasome-mediated  degradation  results  in  the  destruction  of  the  Ub  moiety 
between  B  and  C  before  it  can  be  recognized  by  UBPs.  The  resulting  drop  in  levels 
of  the  CUb  module  relative  to  levels  of  AUb,  referred  to  as  the  C/A  ratio,  reflects 
the  cotranslational  degradation  of  domain  B  (Fig.  IB).  This  measurement  provides 
a  minimal  estimate  of  the  total  amount  of  cotranslational  degradation,  because 
non-processive  cotranslational  degradation  events  that  do  not  extend  into  the  C 
domain  are  not  detected. 
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It  should  be  noted  that  the  Ub  moieties  of  the  fusion  serve  solely  as 
cotranslationally  cleavable  junctions  between  domains  A,  B,  and  C.  They  do  not 
target  the  attached  proteins  for  degradation  (8).  In  addition,  a  lysine  to  arginine 
substitution  at  position  48  of  these  moieties  (UbR48)  prevents  their  conjugation  to 
other  Ub  molecules  at  that  position,  so  they  cannot  be  involved  in  the  formation 
of  a  major  class  of  substrate-linked  multi-Ub  chains  required  for  degradation  by  the 
26S  proteasome  (20,  11). 

UBP-mediated  cleavage  was  previously  used  to  examine  the  in  vivo  kinetics 
of  protein  translocation  into  the  endoplasmic  reticulum  (ER)  (6),  where  it  was 
shown  that  cleavage  was  fast  enough  to  be  cotranslational.  We  directly  verified 
this  in  the  case  of  an  AUb-BUb-CUb  fusion,  using  an  in  vivo  radiolabeling 
regimen  in  which  the  labeling  pulse  was  significantly  shorter  than  the  time 
required  for  the  complete  synthesis  of  AUb-BUb-CUb,  and  terminating  the  pulse  by 
arresting  translation.  At  the  beginning  of  the  pulse,  the  cell's  ribosomes  are  at 
various  stages  in  the  synthesis  of  the  fusion.  As  a  result,  the  pulse  produces  a 
population  of  nascent  AUb-BUb-CUb  chains  that  are  labeled  at  various  regions 
along  the  chain  (Fig.  2B).  The  nascent  chains  that  are  just  starting  to  be  synthesized 
when  the  pulse  begins  will  incorporate  label  into  the  N-terminal  A  domain,  but 
they  will  not  be  elongated  to  full-length  chains  before  the  end  of  the  pulse.  Under 
these  conditions  free  labeled  AUb  could  be  produced  only  if  UBPs  cleave  the 
nascent  polypeptide  chains  (Fig.  2B). 

S.  cerevisiae  cells  expressing  a  190  kD  AUb-BUb-CUb  of  the  form 
{DHFRhaUb}  -  {MeKPgalUb}  -  {MeKDHFRhaUb}  (Fig.  2A)  were  labeled  with 
35S-methionine/ cysteine  for  45  sec  (12).  In  the  eukaryotic  cell  types  examined,  rates 
of  translation  vary  between  2-10  residues/sec,  while  a  rate  of  12  residues/sec  has 
been  measured  in  E.  coli  (13).  Assuming  a  moderate  estimate  of  5  residues/sec  in 
S.  cerevisiae,  the  above  fusion  would  require  -350  sec  for  complete  synthesis,  more 
than  7  times  the  duration  of  the  45  sec  pulse.  The  labeling  was  terminated  by  the 
addition  of  the  translation  inhibitor  cycloheximide,  and  UBPs  were 
simultaneously  inactivated  with  N-ethylmaleimide  (NEM)  (6).  Discrete  bands 
corresponding  to  the  labeled  AUb  (DHFRhaUb)  and  BUb  (MeKpgalUb)  modules 
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were  observed  upon  immunoprecipitation,  indicating  that  UBP-mediated  cleavage 
at  the  AUb-BUb  junction  was  cotranslational  (Fig.  2C).  Moreover,  no  full-length 
AUb-BUb-CUb  fusion  was  detected  (Fig.  2C),  indicating  that  the  cotranslational 
cleavage  by  UBPs  was  highly  efficient.  High  concentrations  of  NEM  in  these 
extracts  increased  nonspecific  immunoprecipitation,  yielding  a  crossreacting 
species  that  co-migrated  with  CUb  (MeKDHFRhaUb).  No  such  crossreacting  band 
was  present  when  NEM  was  omitted  (Fig.  3A).  However,  only  the  presence  of  free 
labeled  AUb  was  diagnostic  of  cotranslational  cleavage;  free  CUb  was  expected  to  be 
observed  whether  cleavage  by  UBPs  was  post-  or  cotranslational  (Fig.  2B). 

This  result  confirmed  the  key  assumption  of  the  Ub  sandwich  technique, 
validating  its  use  to  detect  cotranslational  degradation  in  vivo.  Previous  evidence 
bearing  on  cotranslational  degradation  was  based  on  experiments  with  inhibitors 
or  cell-free  systems  (14).  Nascent  polypeptide  chains  might  be  protected  from 
degradation  in  vivo,  either  because  they  are  sterically  shielded  by  chaperones  or 
because  their  translation  time  is  short  in  comparison  to  the  time  required  for 
targeting  by  the  degradation  machinery.  To  maximize  the  likelihood  of  detecting 
cotranslational  degradation,  a  large,  118  kD,  Pgal-derived  polypeptide  carrying  a 
strong  N-terminal  degradation  signal  was  initially  chosen  as  the  B  domain  of  the 
AUb-BUb-CUb  fusion  (Fig.  2A).  A  protein  of  this  size  is  unable  to  fit  into  the  cavity 
of  the  cytosolic  CCT  chaperonin  (15),  ensuring  that  this  type  of  shielding  was 
unavailable  to  the  test  protein.  In  addition,  the  N-terminal  location  of  the 
degradation  signal  made  it  potentially  accessible  from  the  beginning  of  translation. 

The  degradation  signal  used  was  an  N-degron,  which  is  targeted  by  Ubrlp, 
the  E3  component  of  the  N-end  rule  pathway  (5,  16).  The  Pgal-linked  N-degron 
comprises  the  destabilizing  N-terminal  residue  arginine  (R)  and  a  short, 
lysine-bearing  extension,  eK  (Fig.  2A)  that  is  the  site  of  multi-Ub  chain  attachment 
(22,  27).  ReK-Pgal  is  rapidly  degraded  in  vivo  (t1/2  ~2min)  (5,  7).  Changing  just  the 
N-terminal  residue  of  the  protein  to  methionine  (M)  inactivates  the  degradation 
signal  by  precluding  recognition  by  Ubrlp.  The  resulting  MeK-Pgal  is 
posttranslationally  stable  (t1/2  >  20  hr)  (5,  7).  AUb-BUb-CUb  fusions  in  which 
domain  B  was  either  ReK-Pgal  or  its  N-degron-lacking  counterpart  MeK-Pgal 
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(Fig.  2A)  were  expressed  in  S.  cerevisiae  strains  containing  different  levels  of  Ubrlp 
(18).  The  extent  of  cotranslational  degradation  was  assessed  by  radiolabeling  for 
30  min  and  immunoprecipitation  (19)  to  determine  the  levels  of  CUb  relative  to 
AUb;  this  parameter  is  referred  to  below  as  the  C /A  ratio. 

The  results  of  a  representative  experiment  are  shown  in  Fig.  3 A.  The  C/A 
ratio  was  significantly  and  reproducibly  lower  in  cells  expressing  the 
N-degron-bearing  domain  B,  but  only  in  those  strains  that  also  expressed  Ubrlp. 

To  determine  the  percentage  of  nascent  chains  cotranslationally  degraded  by  the 
N-end  rule  pathway,  we  averaged  the  C/A  ratios  over  several  trials,  and  compared 
the  ratios  for  ReK-Pgal  in  wildtype  (0.52)  and  Ubrlp-overexpressing  strains  (0.39)  to 

the  ratio  found  with  the  ubrlA  strain  (0.86)  (Fig.  4A).  This  comparison  indicated 
that  -40%  of  the  nascent  ReK-Pgal  chains  were  cotranslationally  degraded  in  the 
wildtype  strain  (Fig.  4B)  (20).  This  fraction  increased  to  ~55%  when  Ubrlp  was 
overexpressed  from  the  Pgali  promoter. 

A  different  way  to  measure  the  extent  of  N-degron-dependent 
cotranslational  degradation  was  to  compare  the  C/A  ratios  for  ReK-Pgal  and 
MeK-Pgal  in  the  same  strain.  This  comparison  independently  indicated  -40% 
cotranslational  degradation  of  ReK-figal  in  the  wildtype  strain  (R  =  0.52;  M  =  0.82), 
and  -55%  when  Ubrlp  was  overexpressed  (R  =  0.39;  M  =  0.87).  Note  that  the  C/A 

ratios  for  ReK-Pgal  in  the  ubrlA  strain,  and  for  MeK-Pgal  in  all  the  strains  used, 
were  -0.85  (Fig.  4A),  significantly  less  than  the  1.0  ratio  expected  in  the  absence  of 
cotranslational  degradation;  this  is  discussed  below. 

The  extent  of  cotranslational  degradation  was  expected  to  depend  on  the 
length  of  the  B  domain  (Fig.  lB)^  because  a  larger  protein  requires  a  longer  time  for 
synthesis,  and  possibly  also  because  the  nascent  chain  of  a  larger  protein  extends  a 
greater  distance  from  the  ribosome.  To  address  this  issue,  we  determined  the 
extent  of  cotranslational  degradation  of  two  other  B  domains.  The  mammalian 
Sindbis  virus  RNA  polymerase,  termed  nsP4,  is  a  69  kD  protein  that  naturally  bears 
an  N-degron  (21).  XeK-Ura3p  (X=M  or  R)  is  a  34  kD  enzyme  of  the  S.  cerevisiae 
uracil  biosynthetic  pathway  that  either  carried  (ReK-Ura3p)  or  lacked  (MeK-Ura3p) 
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an  N-degron  (22)  (Fig.  2A).  Initially,  immunoblot  analysis  was  used  to  estimate  the 
steady-state  C/A  ratios  for  these  AUb-BUb-CUb  fusions  (Fig.  3B)  (23).  While  the 
short-lived  R-nsP4  (69  kD)  and  ReK-pgal  (118  kD)  were  cotranslationally  degraded 
to  similar  extents,  ReK-Ura3p  (34  kD),  which  was  also  short-lived 
posttranslationally  (22)  exhibited  much  less  cotranslational  degradation  than  the 
other  two  proteins  (Fig.  3B).  To  measure  C/A  ratios  more  accurately,  in  vivo 
radiolabeling  and  immunoprecipitations  were  carried  out.  The  results  of  several 
experiments,  summarized  in  Figure  4,  clearly  showed  that  in  the  presence  of  a 
strong  N-degron,  cotranslational  degradation  is  significant  for  all  three  of  the  tested 
proteins,  ReK-pgal,  R-nsP4,  and  ReK-Ura3p. 

Comparisons  of  the  C/A  ratios  for  R-nsP4  in  the  wildtype  ( UBR1 )  and  ubrlA 
strains  indicated  that  -50%  of  the  nascent  chains  were  cotranslationally  degraded 
by  the  N-end  rule  pathway  in  wildtype  cells,  and  -55%  in  cells  overexpressing 
Ubrlp  (Fig.  4B).  These  values  were  similar  to,  respectively,  -40%  and  -55% 
cotranslational  degradation  of  ReK-Pgal  in  these  strains  (Fig.  4B).  In  contrast,  only 
-20%  cotranslational  degradation  was  observed  with  the  34  kD  ReK-Ura3p  in  either 
wildtype  or  Ubrl-overexpressing  strains  (Fig.  4B),  suggesting  that  smaller  proteins 
are  less  susceptible  to  cotranslational  degradation. 

These  comparisons  established  the  levels  of  cotranslational  degradation  by 
the  Ubrlp-dependent  N-end  rule  pathway.  In  addition,  the  C/A  ratios  for  two  of 
the  N-degron-lacking  B  domains,  MeK-Pgal  and  M-nsP4,  were  less  than  1.0, 
suggesting  that  they  might  be  cotranslationally  degraded  by  a  Ubrlp-independent 
pathway.  To  determine  the  baseline  C/A  ratio  for  negligible  cotranslational 
degradation,  we  constructed  an  AUb-BUb-CUb  fusion  in  which  domain  B  was  an 
18-residue  sequence  containing  the  FLAG  epitope  (Fig.  2A).  This  sequence  bore  no 
known  degradation  signals  and  would  presumably  be  too  short  to  be  degraded 
cotranslationally.  The  C/A  ratio  observed  was  0.97.  Thus  MeK-Pgal  exhibited  a 
-15%  drop  in  C/A  relative  to  the  FLAG-bearing  peptide,  M-nsP4  a  -25%  drop, 
whereas  the  C/A  ratios  obtained  with  MeK-Ura3  and  FLAG  were  indistinguishable 
(Fig.  4A).  One  interpretation  of  these  differences  is  that,  although  MeK-Pgal  and 
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M-nsP4  are  posttranslationally  stable,  they  may  be  cotranslationally  degraded  to  a 
significant  extent.  Premature  termination  of  translation  and/or  transcription  may 
also  contribute  to  the  drop  in  the  C/A  ratios  for  these  two  N-degron-lacking 
B  domains.  It  is  unlikely  that  premature  termination  of  translation  entirely 
accounts  for  the  observed  difference  between  the  C/A  ratios  for  FLAG  and  M-nsP4, 
because  the  codon  adaptation  index  of  the  M-nsp4  open  reading  frame  (0.1)  is 
higher  than  the  one  for  MeK-Pgal  (0.07),  but  the  drop  in  C/A  ratio  is  greater  with 
M-nsP4  (25%)  than  with  MeK-pgal  (15%). 

This  work  demonstrates  that  a  nascent  polypeptide  carrying  a  degradation 
signal  can  be  cotranslationally  degraded  in  vivo  (Fig.  4B).  Cotranslational  protein 
degradation,  which  cannot  be  detected  by  conventional  pulse-chase  assays,  can  now 
be  studied  using  the  Ub  sandwich  technique  (Fig.  1).  This  method  revealed  that 
the  extent  of  cotranslational  degradation  can  be  strikingly  high:  in  the  case  of 
ReK-pgal  in  Ubrlp-overexpressing  cells,  over  50%  of  nascent  polypeptide  chains 
never  reach  their  full  size  before  their  destruction  by  processive  proteolysis.  Thus, 
if  a  nascent  chain  displays  a  degron  of  the  Ub  system,  such  a  protein  becomes  a 
target  of  kinetic  competition  between  cotranslational  biogenesis  and 
cotranslational  degradation.  Since  the  folding  of  a  protein  molecule  begins  during 
its  synthesis  on  the  ribosome,  a  nascent  polypeptide  may  cotranslationally  expose 
degradation  signals  in  the  form  of,  for  example,  hydrophobic  patches  which 
become  shielded  through  the  folding  of  the  newly  formed  protein  (2,  24).  Our 
findings  with  the  posttranslationally  long-lived  MeK-|3gal  and  M-nsP4  (Fig.  4A)  are 
consistent  with  this  possibility. 

If  cotranslational  protein  degradation  by  the  Ub  system  is  found  to  be 
extensive  in  vivo,  it  could  be  accounted  for  as  an  evolutionary  trade-off  between 
the  necessity  of  identifying  and  destroying  degron-bearing  mature  proteins  and  the 
mechanistic  difficulty  of  distinguishing  between  posttranslationally  and 
cotranslationally  presented  degrons.  Conditional  degradation  signals  that  require 
posttranslational  phosphorylation  for  activation,  such  as  those  in  IxBa  and 
(3-catenin  (25),  may  have  the  additional  feature  of  not  contributing  to 
cotranslational  proteolysis.  Cotranslational  protein  degradation  may  also  represent 
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a  previously  unrecognized  form  of  protein  quality  control,  which  destroys  nascent 
chains  that  fail  to  fold  correctly.  These  and  other  questions  about  physiological 
aspects  of  cotranslational  protein  degradation  can  now  be  addressed  directly  in 
living  cells  through  the  Ub  sandwich  technique. 
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Figure  Legends 

figure  1-  The  ubiquitin  sandwich  technique.  (A)  Organization  of  a  Ub 
sandwich  fusion.  B,  the  polypeptide  assayed  for  cotranslational  degradation,  is 
sandwiched  between  two  stable  reporter  domains  A  and  C.  Red  arrows  indicate  the 
locations  of  UBP  cleavage  sites.  (B)  The  principle  of  the  method.  The  reporter 
module  AUb  is  the  first  to  be  synthesized,  and  is  cotranslationally  released  from  B, 
thereby  providing  a  measure  of  the  number  of  nascent  B  chains  that  initially 
emerge  from  the  ribosome.  If  the  processive  degradation  of  the  emerging  B, 
indicated  by  its  insertion  into  the  cylindrical  proteasome,  is  strictly 
posttranslational,  the  UBP-mediated  cleavage  at  the  BUb-C  junction  releases  CUb 
before  B  is  degraded,  so  the  molar  yields  of  CUb  and  AUb  are  identical.  However,  if 
the  degradation  of  B  can  be  cotranslational,  a  significant  fraction  of  BUb-CUb  may 
be  degraded  as  a  unit.  This  will  result  in  the  molar  yield  of  CUb  being  lower  than 
AUb,  the  difference  being  a  measure  of  cotranslational  degradation. 

figure  2.  UBP-mediated  cleavage  of  ubiquitin  sandwich  fusions  is 
cotranslational.  (A)  The  protein  fusions  used  (29).  Domains  A  and  C  are  mouse 
dihydrofolate  reductase  tagged  with  the  influenza  haemagglutinin-derived  ha 
epitope  (DHFRha).  Domain  C  carries  an  N-terminal  extension  (eK,  see  text)  which 
makes  it  electrophoretically  distinguishable  from  A.  The  different  B  domains  are 
E.  coli  (3-galactosidase  ((3gal),  Sindbis  virus  RNA  polymerase  (nsP4)  and 
S.  cerevisiae  Ura3.  Unstable  (N-degron-bearing)  versions  of  these  domains  have 
an  N-terminal  arginine  (R)  residue;  stable  versions  have  methionine  (M)  (B)  The 
population  of  nascent  chains  produced  by  a  radiolabeling  pulse  significantly 
shorter  than  the  time  of  translation  of  an  AUb-BUb-CUb  fusion.  Stretches  of  the 
polypeptide  containing  radiolabel  are  in  red;  unlabeled  stretches  are  in  black.  If 
UBPs  efficiently  cleave  the  nascent  chain,  free  radiolabeled  AUb,  BUb  and  CUb 
should  all  be  detected.  If  UBPs  can  cleave  solely  the  full-length,  mature  protein, 
only  the  labeled  CUb  module  will  be  observed.  (C)  The  UBP  cleavage  of  Ub 
sandwich  fusions  is  cotranslational.  S.  cerevisiae  expressing  an  AUb-BUb-CUb 
fusion  of  the  form  {DHFRhaUb}-{MeK|3galUb}-{MeKDHFRhaUb}  which  is 
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predicted  to  take  -350  sec  to  be  synthesized,  were  subjected  to  a  45-sec 
35S-methionine  pulse.  Labeling  was  terminated  by  arresting  translation  with 
cycloheximide,  and  UBPs  were  simultaneously  inactivated  with  NEM  (22).  The 
release  of  modules  AUb  (DHFRhaUb),  BUb  (MeKPgalUb)  and  CUb 
(MeKDHFRhaUb)  by  UBP  cleavage  was  assayed  by  immunoprecipitation.  CUb* 
denotes  both  the  MeKDHFRhaUb  band  and  a  crossreacting  band  present  in  these 
NEM-treated  extracts  but  not  in  untreated  ones  (compare  with  Fig.  3A). 

Figure  3.  A  fraction  of  nascent  polypeptides  bearing  an  N-terminal 
degradation  signal  is  degraded  cotranslationally.  (A)  Determination  of  C/A  ratios 
through  immunoprecipitation  of  in  yzbo-labeled  AUb-BUb-CUb  fusion  proteins 
(19).  Two  variants  of  |3gal  as  domain  B  were  used,  one  that  carried  an  N-degron 
(ReK(3gal,  t1/2  -2  min)  and  the  otherwise  identical  pgal  that  differed  by  one  residue 
and  lacked  this  degradation  signal  (MeKpgal,  t1/2  >  20  hr).  The  two  pgal  variants 
were  expressed  in  S.  cerevisiae  strains  containing  different  levels  of  Ubrlp,  the 
rate-limiting  recognition  component  of  the  N-end  rule  pathway.  Each  pair  of  lanes 
corresponds  to  two  independent  experiments.  (B)  Immunoblot  analysis  of  the 
steady-state  C/A  levels  for  different  B  domains  (described  in  Fig.  2A)  (23).  The 
identity  of  the  N-terminal  residue  of  each  of  the  B  domains,  R  (Arg)  or  M  (Met),  is 
indicated  above  each  lane.  F  marks  the  lane  corresponding  to  an  AUb-BUb-CUb 
fusion  protein  in  which  domain  B  was  the  18-residue  FLAG-containing  moiety. 

Figure  4.  The  extent  of  cotranslational  protein  degradation  depends  on  the 
presence  of  a  degron,  the  activity  of  a  degron-specific  proteolytic  pathway,  and  the 
nature  and  size  of  the  protein.  (A)  C/A  ratios  obtained  for  the  different  B  domains. 
(B)  Ubrlp-dependent  cotranslational  degradation.  These  values  were  obtained  by 
comparing  C/A  ratios  observed  for  a  particular  N-degron-bearing  B  domain  in 
wildtype  or  Ubrlp-overexpressing  strains  to  the  ratio  determined  in  the  ubrlA 
strain  (20).  Each  bar  represents  a  mean  value  derived  from  at  least  4  independent 
experiments;  standard  errors  are  indicated.  Differences  between  means  were 
significant  to  at  least  p  <  0.05  by  the  Mann- Whitney  test  (30). 
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The  N-degrons,  a  set  of  degradation  signals  recognized 
by  the  N-end  rule  pathway,  comprise  a  protein’s  desta¬ 
bilizing  N-terminal  residue  and  an  internal  lysine 
residue.  We  show  that  the  strength  of  an  N-degron  can 
be  markedly  increased,  without  loss  of  specificity, 
through  the  addition  of  lysine  residues.  A  nearly 
exhaustive  screen  was  carried  out  for  N-degrons  in  the 
lysine  (K)-asparagine  (N)  sequence  space  of  the 
14-residue  peptides  containing  either  K  or  N  (16  384 
different  sequences).  Of  these  sequences,  68  were  found 
to  function  as  N-degrons,  and  three  of  them  were  at 
least  as  active  and  specific  as  any  of  the  previously 
known  N-degrons.  All  68  K/N-based  N-degrons  lacked 
the  lysine  at  position  2,  and  all  three  of  the  strongest 
N-degrons  contained  lysines  at  positions  3  and  15.  The 
results  support  a  model  of  the  targeting  mechanism  in 
which  the  binding  of  the  E3-E2  complex  to  the  sub¬ 
strate’s  destabilizing  N-terminal  residue  is  followed  by 
a  stochastic  search  for  a  sterically  suitable  lysine 
residue.  Our  strategy  of  screening  a  small  library  that 
encompasses  the  entire  sequence  space  of  two  amino 
acids  should  be  of  use  in  many  settings,  including 
studies  of  protein  targeting  and  folding. 

Keywords :  N-degron/N-end  rule/proteolysis/simple 
sequences/ubiquitin 


Introduction 

Regulatory  proteins  are  often  short-lived  in  vivo ,  providing 
a  way  to  generate  their  spatial  gradients  and  to  rapidly 
adjust  their  concentration  or  subunit  composition  through 
changes  in  the  rate  of  their  synthesis  or  degradation. 
Most  of  the  damaged  or  otherwise  abnormal  proteins  are 
metabolically  unstable  as  well.  Many  other  proteins,  while 
long-lived  as  components  of  larger  stmctures  such  as 
ribosomes  and  oligomeric  proteins,  are  short-lived  as  free 
subunits  (reviewed  by  Hochstrasser,  1996;  Varshavsky, 
1997;  Hershko  and  Ciechanover,  1998;  Scheffner  et  al , 
1998;  Koepp  et  al ,  1999;  Tyers  and  Willems,  1999). 

Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals,  or  degrons  (Laney  and 
Hochstrasser,  1999).  One  class  of  degradation  signals, 
called  the  N-degrons,  comprises  a  protein’s  destabilizing 


N-terminal  residue  and  an  internal  Lys  residue  (Bachmair 
et  al ,  1986;  Varshavsky,  1996).  A  set  of  N-degrons 
containing  different  N-terminal  residues  that  are  destabiliz¬ 
ing  in  a  given  cell  defines  a  rule,  termed  the  N-end  rule, 
which  relates  the  in  vivo  half-life  of  a  protein  to  the 
identity  of  its  N-terminal  residue.  The  lysine  determinant 
of  an  N-degron  is  the  site  of  formation  of  a  substrate- 
linked  multi-ubiquitin  chain  (Bachmair  and  Varshavsky, 
1989;  Chau  et  al ,  1989).  The  N-end  rule  pathway  is  thus 
one  pathway  of  the  ubiquitin  (Ub)  system.  Ub  is  a 
76-residue  protein  whose  covalent  conjugation  to  other 
proteins  plays  a  role  in  a  multitude  of  processes,  including 
cell  growth,  division,  differentiation,  and  responses  to 
stress  (Pickart,  1997;  Varshavsky,  1997;  Peters,  1998; 
Scheffner  et  al ,  1998).  In  many  of  these  settings,  Ub  acts 
through  routes  that  involve  the  degradation  of  Ub-protein 
conjugates  by  the  26S  proteasome,  an  ATP-dependent 
multisubunit  protease  (Coux  et  al ,  1996;  Hilt  and  Wolf, 
1996;  Baumeister  et  al ,  1998;  Rechsteiner,  1998). 

The  N-end  rule  is  organized  hierarchically.  In  the 
yeast  Saccharomyces  cerevisiae ,  Asn  and  Gin  are  tertiary 
destabilizing  N-terminal  residues  in  that  they  function 
through  their  conversion,  by  the  NTA 1 -encoded  N-terminal 
amidase,  into  the  secondary  destabilizing  N-terminal  res¬ 
idues  Asp  and  Glu  (Baker  and  Varshavsky,  1995).  The 
destabilizing  activity  of  N-terminal  Asp  and  Glu  requires 
their  conjugation,  by  the  ATE1 -encoded  Arg-tRNA-protein 
transferase,  to  Arg,  one  of  the  primary  destabilizing 
residues.  The  primary  N-terminal  residues  are  bound 
directly  by  the  UBR1 -encoded  N-recognin,  the  E3  (recog¬ 
nition)  component  of  the  N-end  rule  pathway.  In  Sxerevis - 
iae,  N-recognin  is  a  225  kDa  protein  that  binds  to  potential 
N-end  rule  substrates  through  their  primary  destabilizing 
N-terminal  residues:  Phe,  Leu,  Trp,  Tyr,  lie,  Arg,  Lys  and 
His  (Varshavsky,  1996).  Analogous  components  of  the 
mammalian  N-end  rule  pathway  have  been  identified  as 
well  (Stewart  et  al ,  1995;  Grigoryev  et  al ,  1996;  Kwon 
et  al ,  1998,  1999). 

Studies  with  engineered  N-end  rule  substrates  indicated 
the  bipartite  organization  of  N-degrons  and  suggested  a 
stochastic  model  of  their  targeting,  in  which  specific 
lysines  of  an  N-end  rule  substrate  could  be  assigned 
different  probabilities  of  being  used  as  a  ubiquitylation 
site  (Bachmair  and  Varshavsky,  1989;  Chau  et  al ,  1989; 
Johnson  et  al ,  1990;  Hill  et  al ,  1993;  Varshavsky,  1996; 
Levy  et  al ,  1999).  Most  of  the  evidence  for  this  model  was 
produced  with  a  set  of  N-degrons  in  which  a  destabilizing 
N-terminal  residue  X  was  linked  to  the  ~40-residue 
Escherichia  coli  Lac  repressor-derived  sequence  termed 
eK  [extension  (e)  bearing  lysines  (K)]  (Figure  1  A)  (Bachm¬ 
air  and  Varshavsky,  1989).  The  resulting  X-eK  sequence 
comprised  a  portable  N-degron,  which  could  confer  short 
half-lives  on  test  proteins  such  as  E.coli  (3-galactosidase 
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Fig.  1.  Test  proteins.  (A)  Fusions  used  in  this  work  contained  some  of 
the  following  elements  (see  Materials  and  methods):  DHFRha,  a 
mouse  dihydrofolate  reductase  moiety  extended  at  the  C-terminus  by  a 
sequence  containing  the  hemagglutinin-derived  ha  epitope;  the  UbR48 
moiety  bearing  the  Lys— >Arg  alteration  at  position  48;  a  40-residue 
E.coli  Lac  repressor-derived  sequence,  termed  eK  and  shown  in  single¬ 
letter  abbreviations  for  amino  acids;  a  variable  residue  X  (either  Tyr, 
His  or  Met)  between  UbR48  and  eK;  the  E.coli  pgal  moiety  lacking  the 
first  24  residues  of  wild-type  pgal.  The  positions  of  single  or  multiple 
KRK  insertions  into  eK  are  indicated.  The  endogenous  KRK  sequence 
of  eK  is  underlined.  The  arrow  indicates  the  site  of  in  vivo  cleavage  by 
DUBs.  (B)  The  fusion  construct  used  for  screening  in  the  K/N 
sequence  space.  The  14  residues  of  eK  immediately  following  the 
residue  X  were  replaced  by  a  set  of  14-residue  sequences  that 
comprised  a  random  permutation  of  Lys  and  Asn  residues,  followed  by 
the  sequence  HGSGAWLLPVSLVRS,  derived  from  residues  2-14  of 
the  eK  extension,  followed  by  Arg-Ser. 


(Pgal)  or  mouse  dihydrofolate  reductase  (DHFR)  (Varshav¬ 
sky,  1996).  At  least  one  of  two  lysines  (K)  in  eK,  either 
K-15  or  K-17,  must  be  present  for  the  N-degron  to  be 
active  (Figure  1A)  (Bachmair  and  Varshavsky,  1989; 
Johnson  et  al ,  1990).  Even  though  several  other  classes 
of  N-degron,  including  the  naturally  occurring  ones,  have 
been  described  over  the  last  decade  (Townsend  et  al , 
1988;  deGroot  et  al ,  1991;  Dohmen  et  al ,  1994;  Sadis 
and  Finley,  1995;  Ghislain  et  al ,  1996;  Sijts  et  al ,  1997; 
Tobery  and  Siliciano,  1999),  the  mechanistic  understanding 
of  these  degradation  signals  remains  confined  largely  to  the 
eK-based  N-degrons  (Varshavsky,  1996;  Levy  et  al,  1999). 

In  the  present  work,  we  show  that  spiking  an  eK-based 
N-degron  with  additional  Lys  residues  can  markedly 
increase  its  activity.  We  also  show,  using  a  new  approach 
of  searching  in  the  sequence  space  of  lysine  and  asparagine, 
that  simple-sequence  N-degrons  can  be  as  strong  and 
specific  as  any  of  the  previously  known  N-degrons.  These 
findings  provide  independent  evidence  for  the  model  of  a 
bipartite  N-degron  and  stochastic  targeting  mechanism 
(Bachmair  and  Varshavsky,  1989).  The  strategy  of  exhaust¬ 
ive  searching  in  the  sequence  space  of  two  amino  acids 
should  be  of  use  in  many  settings,  including  studies  of 
protein  folding  and  degradation. 


Results  and  discussion 

The  Ub/ protein /reference  technique 

The  assays  below  utilized  the  previously  developed  Ub/ 
protein/reference  (UPR)  technique,  which  increases  the 


accuracy  of  pulse-chase  analysis  by  providing  a  ‘built-ifi’ 
reference  protein  (Levy  et  al,  1996).  This  method  employs 
a  linear  fusion  in  which  Ub  is  located  between  a  protein 
of  interest  and  a  reference  protein  moiety  (Figure  1A). 
The  fusion  is  co-translationally  cleaved  by  Ub-specific 
de-ubiquitylating  enzymes  (DUBs)  (Wilkinson  and 
Hochstrasser,  1998)  after  the  last  residue  of  Ub,  producing 
equimolar  amounts  of  the  protein  of  interest  and  the 
reference  protein  bearing  the  C-terminal  Ub  moiety.  If 
both  the  reference  protein  and  the  protein  of  interest  are 
immunoprecipitated  in  a  pulse-chase  assay,  the  relative 
amounts  of  the  protein  of  interest  can  be  normalized 
against  the  reference  protein  in  the  same  sample.  The 
UPR  technique  can  thus  compensate  for  the  scatter  of 
immunoprecipitation  yields,  sample  volumes  and  other 
sources  of  sample-to-sample  variation  (Levy  et  al , 
1996,  1999). 

Two  previously  introduced  terms,  ID*  initial  decay,  i.e. 
the  decay  of  a  protein  during  the  pulse  of  x  min,  and 
t  o7i ,  the  protein’s  half-life  averaged  over  the  interval  of  y 
to  z  min  of  chase  (Levy  et  al ,  1996),  are  used  below  to 
describe  the  decay  curves  of  test  proteins.  The  ID*  term 
and  the  interval-specific  term  t$ “f  would  be  superfluous 
in  the  case  of  a  strictly  first-order  decay,  which  is  defined 
by  a  single  half-life.  However,  the  in  vivo  degradation 
of  most  proteins  deviates  from  first-order  kinetics.  For 
example,  the  rate  of  degradation  of  short-lived  proteins 
can  be  much  higher  during  the  pulse,  in  part  because  a 
newly  labeled  (either  nascent  or  just-completed)  polypep¬ 
tide  is  conformationally  immature  and  may,  consequently, 
be  targeted  for  degradation  more  efficiently  than  its  mature 
counterpart.  This  enhanced  early  degradation,  previously 
termed  the  ‘zero-point’  effect  (Baker  and  Varshavsky, 
1991),  is  described  by  the  parameter  ID*  (Levy  et  al , 
1996).  It  was  found  that  a  large  fraction  of  the  zero-point 
effect  results  from  the  co-translational  degradation  of 
nascent  (being  synthesized)  polypeptide  chains,  which 
never  reach  their  mature  size  before  their  destruction 
by  processive  proteolysis  (G.Turner  and  A.Varshavsky, 
unpublished  data).  The  detection  of  a  zero-point  effect 
requires  the  comparison  of  a  test  protein’s  degradation 
between  cells  containing  and  lacking  the  relevant  proteo¬ 
lytic  pathway.  Alternatively,  the  zero-point  effect  can  be 
detected  by  comparing,  through  the  UPR  technique,  the 
degradation  of  otherwise  identical  degron-containing  and 
degron-lacking  versions  of  a  test  protein  (Levy  et  al , 
1996,  1999).  Although  the  degradation  of  a  protein  during 
the  pulse  can  be  strikingly  high  (Levy  et  al ,  1996) 
(see  also  below),  it  is  not  detectable  by  a  conventional, 
reference-lacking  pulse-chase  assay. 

Increasing  the  strength  of  N-degrons  by  spiking 
them  with  additional  lysines 

The  UPR  constructs  of  the  present  work  were  DHFR-ha- 
UbR48-X-eK-(Jgal  fusions.  They  contained  the  meta- 
bolically  stable,  ha-epitope-bearing  DHFR-ha-UbR48 
moiety  as  a  reference  protein,  termed  dha-Ub  below.  The 
dha-Ub-X-eK-pgal  proteins  were  co-translationally  cleaved 
in  vivo ,  yielding  the  test  protein  X-eK-Pgal  and  the 
reference  dha-Ub  (Figure  1A).  To  reduce  the  possibility 
that  the  C-terminal  Ub  moiety  of  dha-Ub  could  function 
as  a  ubiquitylation/degradation  signal,  the  K-48  residue 
of  Ub  (a  major  site  of  isopeptide  bonds  in  multi-Ub 
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chains)  was  converted  to  Arg,  which  cannot  be  ubi- 
quitylated,  yielding  UbR48  (Levy  et  al ,  1996).  These  and 
related  fusions  (Figure  1A)  were  expressed  in  S.cerevisiae 
from  low  copy  plasmids  and  the  copper-inducible  P Cupi 
promoter. 

X-eK~Pgal  is  an  extensively  analyzed  class  of  N-end 
rule  substrates,  which  contain  a  variable  N-terminal  residue 
X  (produced  through  the  DUB-mediated  cleavage  of 
dha-Ub-X-eK-pgal  at  the  Ub-X  junction),  a  40-residue 
N-terminal  extension  called  eK  (see  Introduction),  and  a 
Pgal  moiety  lacking  the  first  24  residues  of  wild-type 
E.coli  Pgal  (Figure  1A).  If  K-15  and  K-17,  the  only 
lysines  of  the  eK  extension  (Figure  1A),  are  replaced  by 
Arg  residues,  which  cannot  be  ubiquitylated,  the  resulting 
X-eAK-Pgal  is  long-lived  even  if  its  N-terminal  residue  is 
destabilizing  in  the  N-end  rule  (Bachmair  et  al .,  1986; 
Johnson  et  al ,  1990).  The  inactivity  of  N-degron  in  X- 
eAK_pgai  js  caused  by  the  absence  of  targetable  lysines 
(Varshavsky,  1996).  Specifically,  the  multiple  lysines  of 
the  pgal  moiety  in  X-eAK-pgal  (Chau  et  al. ,  1989)  cannot 
serve  as  N-degron  determinants,  apparently  because  the 
most  N-terminal  Lys  residue  in  X-eAK~Pgal,  at  position 
239,  is  too  far  from  the  protein’s  N-terminus. 

One  of  our  aims  was  to  produce  stronger  N-degrons. 
We  chose  Tyr,  a  moderately  destabilizing  type  2  residue 
(Bachmair  and  Varshavsky,  1989;  Varshavsky,  1996),  as 
the  N-terminal  residue  of  an  initial  test  protein  (Figure  1  A). 
More  strongly  destabilizing  N-terminal  residues,  e.g.  Leu 
or  Arg,  in  the  context  of  (expected)  stronger  N-degrons 
would  have  made  the  test  proteins  too  short-lived  for 
detection  in  a  pulse-chase  assay.  Met  was  employed  as  a 
stabilizing  N-terminal  residue.  The  term  ID5  below  (see 
Materials  and  methods)  conveys  the  extent  of  degradation 
of  a  protein  during  the  5  min  pulse,  in  comparison  with 
the  degradation,  during  the  same  pulse,  of  a  control 
(degron-lacking,  i.e.  Met-bearing)  version  of  the  same 
protein. 

To  determine  whether  the  degradation  of  Tyr-eK-Pgal 
in  S.cerevisiae  could  be  enhanced  through  the  addition  of 
Lys  residues  while  remaining  dependent  on  the  Ubrlp 
N-recognin,  the  sequence  Lys-Arg-Lys  (KRK),  identical 
to  the  sequence  at  positions  15-17  of  eK,  was  inserted 
at  the  indicated  locations  within  eK  (Figure  1A).  The 
unmodified  Tyr-eK-pgal  had  an  ID5  of  -48%,  i.e.  -48% 
of  the  labeled  Tyr-eK-Pgal  was  destroyed  during  the  5  min 
pulse,  before  time  0.  The  t^°  (half-life  between  0  and 
10  min  of  chase)  of  Tyr-eK~Pgal  was  -26  min  (Figures  2A 
and  3A).  The  KRK  sequence  inserted  at  any  of  the 
indicated  three  positions  within  eK  (Figure  1A)  strongly 
destabilized  the  already  short-lived  Tyr-eK-pgal:  for 
example,  Tyr-KleK-Pgal  (Figure  1A)  had  an  ID5  of  -75% 
and  to310  °f  min  (Figures  2A  and  3A).  The  increased 
degradation  of  Tyr-eK-pgal  derivatives  containing  extra 
KRK  remained  completely  Ubrlp-dependent:  Tyr-eK-Pgal, 
Tyr-KleK-Pgal,  Tyr-K2eK-Pgal  and  Tyr-K3eK-pgal  were 
all  long-lived  proteins  (t05  >10  h)  in  ubrlA  cells 
(Figures  2A  and  3A).  In  addition,  Met-eK-pgal,  Met-KleK- 
Pgal,  Met-K2eK-Pgal  and  Met-K3eK-pgal,  the  Met-bearing 
counterparts  of  the  Tyr-based  N-end  rule  substrates,  were 
long-lived  in  either  UBR1  or  ubrlA  cells  (data  not  shown). 

These  results  led  us  to  examine  the  effects  of  adding 
more  than  one  KRK  sequence  to  eK  (Figure  1A).  The 
resulting  Tyr-eK~Pgal  derivatives,  bearing  either  two 
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Fig.  2.  Active  N-degrons  can  be  strongly  enhanced  by  additional 
lysines.  (A)  Congenic  UBR1  fwt)  and  ubrlA  S.cerevisiae  that 
expressed  the  UPR-based  fusions  Met-eK-Pgal  (Mek)  (DHFR-ha- 
UbR48-Met-eK-pgal),  Tyr-eK-pgal  (Yek),  Tyr-KleK-pgal  (YK1), 

Tyr- K2eK- pgal  (YK2)  and  Tyr-K3eK-Pgal  (YK3)  (see  Figure  1A)  were 
labeled  with  [35S]methionine/cysteine  for  5  min  at  30°C,  followed  by 
a  chase  for  0,  10  and  30  min,  extraction,  immunoprecipitation  with 
anti-ha  and  anti-pgal  antibodies,  SDS-PAGE,  and  autoradiography  (see 
Materials  and  methods).  The  bands  of  X-pgal  (test  protein)  and 
DHFR-ha-UbR48  (reference  protein)  are  indicated  on  the  right.  (B)  As 
in  (A),  but  with  Met-eK-pgal  (Mek),  Tyr-eK-pgal  (Yek),  Tyr-KleK-pgal 
(YK1),  Tyr-KKleK-pgal  (YKK1)  and  Tyr-KKKleK-pgal  (YKKK1) 

(see  Figure  1A).  (C)  As  in  (A),  but  with  Met-eK-pgal  (Mek),  His- 
eK-pgal  (Hek),  His-KleK-pgal  (HK1),  His-KKleK-pgal  (HKK1)  and 
His-KKKleK-pgal  (HKKK1).  (D)  Metabolic  stability  of 
conformationally  mature  Tyr-eK-Pgal  and  its  KRK-spiked  derivatives. 
JD54  ( Pgali-UBRI )  cells  expressing  Tyr-eK-pgal  (Yek),  Tyr-KleK- 
Pgal  (YK1)  or  Tyr-KKleK-pgal  (YKK1)  were  grown  in  SM-raffinose 
medium  (no  expression  of  Ubrlp),  then  labeled  with  [35S] methionine/ 
cysteine  for  10  min  at  30°C.  After  a  20  min  chase  in  SM-raffinose, 
galactose  was  added  to  3%  to  induce  Ubrlp  expression,  followed  by  a 
chase  for  1,  3  and  6  h,  and  the  analysis  of  immunoprecipitated  test 
proteins.  Lanes  1,  6  and  11,  the  end  of  35S  labeling  (time  0).  Lanes  2, 
7  and  12,  the  end  of  20  min  chase  in  SM-raffinose.  Lanes  3,  8  and  13, 
1  h  chase  with  galactose.  Lanes  4,  9  and  14,  3  h  chase.  Lanes  5,  10 
and  15,  6  h  chase. 


(Tyr-KKleK-pgal)  or  three  (Tyr-KKKleK-(3gal)  KRK 
sequences,  in  addition  to  the  original  KRK  of  eK,  were 
extremely  short-lived  proteins,  even  though  N-terminal 
Tyr  is  a  weakly  destabilizing  residue  (Varshavsky,  1996). 
For  example,  Tyr-KKleK-|5gal  (Figure  1A)  had  ffe10  of 
-4  min  (in  comparison  with  -26  min  in  the  case  of  Tyr- 
eK-Pgal)  and  an  ID5  of  -94%.  In  other  words,  -94%  of 
the  labeled  Tyr-KKleK-(3gal  was  destroyed  during  the 
5  min  pulse,  before  time  0  (Figures  2B  and  3B).  At  the 
same  time,  all  of  these  proteins  were  long-lived  in  ubrlA 
cells  (Figures  2B  and  3B). 

The  N-terminal  Tyr  is  bound  by  the  type  2  site  of 
N-recognin  (Ubrlp)  that  recognizes  substrates  bearing 
bulky  hydrophobic  N-terminal  residues  (Varshavsky, 
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Fig.  3.  Quantitation  of  degradation  of  the  test  proteins.  (A)  Pulse- 
chase  patterns  of  Met-eK-pgal  (MeK),  Tyr-eK-pgal  (YeK),  Tyr-KleK- 
pgal  (YK1),  Tyr-K2eK-pgal  (YK2)  and  Tyr-K3eK-pgal  (YK3)  (see 
Figures  1A  and  2 A)  were  quantitated  using  the  UPR  technique  and 
Phosphorlmager  (see  Materials  and  methods).  Time  0  refers  to  the  end 
of  the  5  min  pulse;  100%  refers  to  the  relative  amount  of  Met-eK-pgal, 
normalized  against  the  reference  protein  dha-UbR48.  (B)  As  in  (A)  but 
with  Met-eK-pgal  (MeK),  Tyr-eK-pgal  (YeK),  Tyr-KleK-pgal  (YK1), 
Tyr-KKleK-Pgal  (YKK1)  and  Tyr-KKKleK-Pgal  (YKKK1)  (see 
Figures  1A  and  2B).  (C)  As  in  (A)  but  with  Met-eK-Pgal  (MeK),  His- 
eK-pgal  (HeK),  His-KleK-pgal  (HK1),  His-KKleK-|3gal  (HKK1)  and 
His-KKKleK-pgal  (HKKK1)  (see  Figures  1A  and  2C).  (D)  As  in  (A), 
but  quantitation  of  the  post-translational  degradation  of  Tyr-eK-pgal 
(YeK),  Tyr-KleK-pgal  (YK1)  and  Tyr-KKleK-pgal  (YKK1)  in  the 
P gali~UBR1  strain  JD54  (see  Figures  1A  and  2 A),  following  the 
induction  of  Ubrlp  by  galactose.  Time  0  refers  to  the  end  of  the 
10  min  labeling  in  raffinose  (no  Ubrlp).  The  cells  were  incubated  for 
another  20  min  in  raffinose,  followed  by  the  addition  of  galactose  to 
induce  Ubrlp  (see  Materials  and  methods).  (E)  Relative  enzymatic 
activities  of  Pgal  in  UBR1  cells  (filled  bars)  and  ubrlA  cells  (striped 
bars)  expressing  one  of  the  following  test  proteins:  Met-eK~pgal 
(MeK),  Tyr-eK-Pgal  (YeK),  Tyr-KleK-Pgal  (YK1),  Tyr-KKleK-Pgal 
(YKK1),  Tyr-KKK  1  ueK-pgal  (YKKK1),  His-eK-pgaI  (HeK),  His-KleK- 
Pgal  (HK1),  His-KK  1  eK-Pgal  (HKK1)  and  His-KKK3eK-Pgal 
(HKKK1)  (see  Materials  and  methods).  The  activities  of  Pgal  were 
normalized  to  the  activity  of  Met-eK-Pgal  in  each  cell.  Values  shown 
are  the  means  from  duplicate  measurements,  which  yielded  results 
within  10%  of  the  mean  values. 


1996).  We  asked  whether  the  above  findings  were  also 
relevant  to  the  type  1  (basic)  destabilizing  N-terminal 
residues  Arg,  Lys  and  His,  which  are  bound  by  the  type  1 
site  of  Ubrlp.  Counterparts  of  the  Tyr-pgal  fusions  that 
bore  N-terminal  His  (a  weak  type  1  destabilizing  residue) 
were  constructed  (Figure  1A)  and  tested  in  pulse-chase 
assays.  The  His  residue  was  chosen  as  a  type  1  destabilizing 
residue  in  these  tests  for  the  same  reason  as  Tyr  in  the 
preceding  tests:  a  stronger  destabilizing  residue  would 
have  made  the  measurements  impractical  with  extremely 
short-lived  substrates.  The  results  (Figures  2C  and  3C) 
confirmed  the  generality  and  specificity  of  degradation 
enhancement  by  the  additional  KRK  sequences.  For 
example,  His-KleK-pgal  had  an  ID5  of  -95%  (i.e.  -95% 
of  the  labeled  His-KleK-|3gal  was  destroyed  during  the 
5  min  pulse,  before  time  0),  in  comparison  with  the  ID5 
of  -41%  for  His-eK-Pgal;  the  corresponding  t^0  values 
were  -5  min  and  -8  min  for  His-KleK-pgal  and  His-eK- 
Pgal,  respectively  (Figure  3C). 

Similar  to  the  results  with  Tyr-bearing  substrates,  their 
His-bearing,  multiple  KRK-containing  counterparts  were 
long-lived  in  ubrlA  cells  (Figures  2C  and  3C).  The 
only  exception  was  His-KKKleK-Pgal  (Figure  1A),  which 
contained  three  KRK  sequences,  in  addition  to  the  KRK 
of  the  original  eK:  in  contrast  to  Tyr-KKK leK- pgal,  His- 
KKKleK-pgal  was  stabilized  strongly  but  incompletely  in 
the  ubrlA  genetic  background  (Figure  3C).  Thus,  the 
His-KKKleK  extension,  in  contrast  to  the  Tyr-KKKleK 
extension  (Figure  1A),  appears  to  contain  a  Ubrlp-inde- 
pendent  degron. 

Previous  work  (Madura  et  al,  1993;  Kwon  et  al ,  1999) 
has  shown  that  the  steady-state  level  of  an  X-Pgal  protein 
(determined  by  measuring  the  enzymatic  activity  of  Pgal 
in  yeast  extracts)  is  a  sensitive  measure  of  its  metabolic 
stability.  The  results  of  this  steady-state  assay  were  in 
agreement  with  those  derived  from  pulse-chase  measure¬ 
ments:  the  level  of  Tyr-eK-Pgal  in  UBR1  cells  was  53% 
of  the  level  of  the  long-lived  Met-eK-pgal,  whereas  Tyr- 
KleK-pgal  was  present  at  4%  of  the  Met-eK-pgal  level, 
and  the  concentration  of  Tyr-KKK  leK- Pgal  was  virtually 
indistinguishable  from  the  assay’s  background  (cells  trans¬ 
formed  with  vector  alone)  (Figure  3E).  Crucially,  the 
levels  of  these  extra  KRK-bearing  Tyr-eK~Pgal  fusions 
in  ubrlA  cells  became  similar  to  that  of  Met-eK-pgal 
(Figure  3E),  in  agreement  with  the  pulse-chase  data 
(Figures  2A,  B  and  3A,  B). 

Although  the  addition  of  extra  KRK  sequences  to 
eK  yielded  considerable  decreases  in  the  ?o7510  of  the 
corresponding  N-end  rule  substrates,  by  far  the  major 
effect  of  multiple  KRK  sequences  was  on  the  decay 
curves’  ID5  term,  which  conveys  the  extent  of  degradation 
of  a  protein  during  or  shortly  after  its  synthesis 
(Figures  2A-C  and  3A-C).  To  examine  this  issue  in  a 
different  way,  Tyr-eK-pgal,  Tyr-KleK-Pgal  and  Tyr- 
KKleK-Pgal  (Figure  1A)  were  produced  in  the  JD54 
Sxerevisiae  strain,  which  expressed  Ubrlp  from  the  galac¬ 
tose-inducible,  dextrose-repressible  VGAL1  promoter.  JD54 
cells  expressing  one  of  the  test  proteins  were  labeled  in 
raffinose-containing  SR  medium  (no  Ubrlp),  incubated 
for  20  min  in  the  same  medium  and  thereafter  shifted  to 
galactose,  where  Ubrlp  was  induced.  Even  though  the 
N-end  rule  pathway  became  hyperactive  in  the  presence 
of  galactose  (Madura  and  Varshavsky,  1994;  Ghislain 
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et  al. ,  1996;  data  not  shown),  the  pre-labeled  substrates 
Tyr-eK-|3gal  and  Tyr-KleK-pgal  were  barely  degraded  after 
the  induction  of  Ubrlp;  Tyr-KKleK-pgal  was  degraded 
only  slightly  (Figures  2D  and  3D). 

These  findings  were  consistent  with  the  earlier 
evidence  for  a  strong  retardation  of  the  post-translational 
degradation  of  Arg-eK-Pgal  under  the  same  conditions 
(R J.Dohmen  and  A.Varshavsky,  unpublished  data).  Thus, 
in  contrast  to  a  newly  formed,  conformationally  immature 
Pgal-based  test  protein,  a  conformationally  mature  pgal 
tetramer  is  a  poor  substrate  of  the  N-end  rule  pathway 
even  in  the  presence  of  N-degron  enhancements  such 
as  the  additional  KRK  sequences.  It  is  the  pgal  moiety 
of  these  test  proteins  (Figure  1A)  that  was  responsible 
for  the  time-dependent  decline  in  the  rate  of  degradation, 
because  the  kinetics  of  in  vivo  degradation  of  eK-DHFR- 
based  N-end  rule  substrates  was  much  closer  to  first- 
order  decay  (Levy  et  al. ,  1999;  data  not  shown). 

Locating  N-degrons  in  the  lysine-asparagine 
sequence  space 

The  earlier  work,  which  led  to  the  bipartite  model  of 
N-degron  (Bachmair  and  Varshavsky,  1989;  Hill  et  al. , 
1993),  and  particularly  the  present  findings  about  the 
effects  of  adding  KRK  sequences  to  an  eK-based  N-degron 
(Figures  2  and  3)  suggested  that  a  substrate’s  destabilizing 
N-terminal  residue  and  a  sterically  suitable  internal  Lys 
residue  (or  residues)  are  the  two  necessary  and  sufficient 
components  of  an  N-degron.  However,  since  both  the 
eK-based  and  other  previously  analyzed  N-degrons  are 
embedded  in  complex  sequence  contexts  (deGroot  et  al, 
1991;  Dohmen  et  al.,  1994;  Varshavsky,  1996),  we  wished 
to  address  the  bipartite-degron  model  by  constructing  an 
N-degron  from  much  simpler  sequence  motifs.  Should 
this  prove  feasible,  we  also  wanted  to  explore  constraints 
on  the  structure  of  N-degrons  through  a  screen  in  a  simpler 
sequence  setting.  If  the  sequence  space  could  be  reduced 
strongly  enough,  one  advantage  of  such  a  screen  would 
be  its  exhaustiveness.  The  AAA  codon  for  lysine  differs 
by  just  one  third-letter  substitution  from  the  codon  for 
asparagine  (AAU),  a  polar  uncharged  residue.  Thus,  one 
could  attempt  a  screen  for  N-degrons  in  the  sequence 
space  of  two  amino  acids:  Lys  (K)  and  Asn  (N). 

A  double-stranded  oligonucleotide  that  encoded  random 
14-residue  K/N  sequences  (see  Materials  and  methods) 
was  used  to  replace  the  sequence  encoding  14  residues  of 
eK  immediately  following  the  residue  X  (Figure  IB).  In 
the  resulting  test  proteins,  this  latter  sequence,  HGSG- 
AWLLPVSLVRS  (plus  the  sequence  RS),  followed  the 
quasi-random  14-residue  K/N  sequence  (Figure  1A).  The 
resulting  K/N-based  extensions  either  lacked  the  lysines 
or  contained  a  variable  number  of  them  between  residues 
2  and  16.  The  K-17  of  eK  (the  only  other  lysine  in  eK) 
was  replaced  by  Arg.  In  these  test  fusions,  dha-UbR48- 
Arg-(K/N)i4-eA-pgals,  Arg  was  used  as  a  destabilizing 
N-terminal  residue  (Figure  IB).  The  number  of  different 
14-residue  sequences  containing  exclusively  K  or  N  is 
214  =  1 6  384.  The  bulk  of  a  library  of  this  complexity  could 
be  encompassed  with  conventional  screening  methods. 
Testing  of  the  pRKN14-based  library  by  amplifying  it  in 
E.coli  indicated  that  >90%  of  the  plasmids  contained  an 
oligonucleotide  insert.  The  pRKN14  library  was  intro¬ 
duced  into  S.cerevisiae  JD54  (Ghislain  et  al,  1996),  which 


expressed  Ubrlp  from  the  P GAU  promoter,  and  screened 
for  colonies  that  stained  blue  with  XGal  [high  levels  of 
Arg-(K/N)14-eA-(3gal]  on  dextrose  (SD)  plates  but  stained 
white  [low  levels  of  Arg-(K/N)14-eA-Pgal]  on  replica- 
plated  galactose  (SG)  plates.  Approximately  20  000  colon¬ 
ies  were  screened  this  way.  A  total  of  68  isolates  were 
identified  in  which  the  activity  of  Pgal  was  significantly 
higher  in  the  absence  than  in  the  presence  of  the  N-end 
rule  pathway. 

The  corresponding  Arg-(K/N)14-eA-pgal  test  proteins 
were  expressed  in  congenic  ubrlA  and  UBR1  strains,  and 
the  ratio  of  Pgal  activities  was  determined  for  each  of  the 
test  proteins.  The  results  are  summarized  in  Figure  4, 
which  shows  the  K/N  sequences  of  the  30  most  active 
N-degrons,  and  the  ratios  of  the  corresponding  pgal 
activity  in  the  ubrlA  strain  to  that  in  the  UBR1  strain 
(higher  ratios  indicate  stronger  N-degrons).  Remarkably, 
the  strongest  K/N-based  N-degron  was  found  to  be  more 
active  than  the  strongest  eK-based  N-degron  (Figure  4). 
Black  bars  in  Figure  4  denote  Pgal  activity  derived  from 
constructs  carrying  K/N-based  N-degrons  with  lysines 
present  at  positions  3  and  15;  the  strongest  N-degrons 
were  largely  of  this  class  (Figure  4).  K-15  was  present  in 
the  15  strongest  K/N-based  N-degrons  except  one  (clone 
132),  which  had  K  at  position  14,  and  was  also,  presumably 
in  compensation  for  the  absence  of  K-15,  one  of  the  most 
lysine-rich  N-degrons  in  this  set  (Figure  4).  Similarly,  K-3 
was  present  in  the  15  strongest  K/N-based  N-degrons 
except  three  (clones  3,  77  and  138)  (Figure  4).  All  of 
these  exceptional  clones  bore  K-15;  in  addition,  one  of 
them  (clone  77)  bore  K-4  and  K-5,  as  well  as  K-14  and 
K-15  (Figure  4). 

A  completely  uniform  feature  of  all  68  K/N-based 
N-degrons  was  the  absence  of  K  from  position  2  (Figure  4; 
data  not  shown),  consistent  with  the  fact  that  all  of  the 
previously  examined  N-degrons  (Varshavsky,  1996)  also 
lacked  a  strongly  basic  residue  (Arg  or  Lys)  at  position  2. 
To  address  this  issue  directly,  we  used  site-directed  muta¬ 
genesis,  replacing  asparagine  at  position  2  of  clone  119 
(the  strongest  K/N-based  N-degron;  Figure  4)  with  lysine. 
The  resulting  test  protein  was  long-lived  in  UBR1  cells 
(data  not  shown),  confirming  that  lysine  is  not  tolerated 
at  position  2  of  an  N-degron. 

Even  though  all  of  the  strongest  K/N-based  N-degrons 
contained  at  least  three  lysines,  the  total  number  of  lysines 
in  a  K/N-based  N-degron  was  not  a  strong  predictor  of 
its  activity,  and  the  arrangement  of  additional  lysines 
between  positions  3  and  15  did  not  correlate,  in  an  obvious 
way,  with  the  activity  of  N-degrons  (Figure  4).  None  of 
the  15  strongest  K/N-based  N-degrons  had  more  than  six 
lysines  (most  had  from  three  to  five)  (Figure  4),  indicating 
that  the  density  of  lysines  per  se  is  not  the  main  feature 
of  a  strong  K/N-based  N-degron.  Note  that  our  screen 
rejected  K/N-based  degrons  that  exhibited  a  significant 
Ubrlp-independent  activity.  This  may  in  part  account  for 
the  upper  limit  on  the  number  of  lysines  in  the  strongest 
N-degrons:  more  lysine-rich  K/N  sequences  could  contain 
motifs  recognized  by  non-N-end  rule  pathways,  similarly 
to  the  above  His-KKKl-pgal  test  protein  (Figures  1A,  2C 
and  3C). 

The  in  vivo  degradation  of  Arg-(K/N)14-eA-Pgal  proteins 
bearing  K/N-based  N-degrons  [produced  from  the  dha- 
UbR48-Arg-(K/N)]4-eA-Pgal  fusions  (Figure  IB)]  was  corn- 
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Residue  position 


pgal  activity  ratio  (  ubrIA  cells lUBRI  cells) 


Fig.  4.  N-degrons  in  the  K/N  sequence  space.  The  deduced  sequences  of  the  identified  K/N  N-degrons  are  shown  in  conjunction  with  the  bar 
diagram  of  their  relative  activity,  defined  as  the  ratio  of  pgal  activities  in  the  ubrIA  versus  UBR1  cells  expressing  a  given  Arg- (K/N)  14-eA- Pgal  test 
protein.  The  top  bar  (vertical  stripes)  indicates  the  relative  activity  of  N-degron  in  the  original  Arg-eK-pgal  (ReK).  Thirty  14-residue  K/N  extensions 
with  the  highest  Ubrlp-dependent  destabilizing  activity  are  listed,  out  of  the  total  of  68  isolates  that  were  metabolically  unstable  in  the  presence  but 
not  in  the  absence  of  Ubrlp.  The  total  number  of  possible  14-residue  K/N  sequences  is  16  384  (see  the  main  text).  The  Lys  and  Asn  residues  are 
denoted  as  the  letter  K  and  a  hyphen,  respectively.  Position  1  in  each  clone  was  occupied  by  Arg.  The  extensions  in  which  lysines  were  present 
either  at  positions  3  and  15,  or  only  at  15,  or  only  at  3,  are  marked,  respectively,  by  the  filled,  striped  and  open  bars. 


pared  with  the  degradation  of  Arg-eK-pgal  in  pulse-chase 
assays  (Figure  5A  and  B).  The  chosen  K/N  degrons 
(clones  119,  113  and  4)  were  the  two  most  active  ones  in 
the  |3gal  assay  (clones  119  and  113)  (Figure  4),  and  a 
relatively  weak  one  (clone  4)  (Figure  6A).  As  expected 
from  the  results  of  steady-state  Pgal  assays  (Figures  4), 
Arg-(K/N)14-eA-pgal  proteins  119  and  113  were  extremely 
short-lived  in  UBRl  cells  (Figure  5A),  in  contrast  to  their 
stability  in  ubrIA  cells  (Figure  5B).  As  with  Arg-eK-Pgal, 
the  bulk  of  degradation  of  Arg-(K/N)14-eA-pgals  took 
place  either  during  or  shortly  after  their  synthesis,  so  that 
even  at  time  0  (at  the  end  of  the  5  min  pulse)  the  test 
proteins  could  be  detected  only  by  overexposing  the 
autoradiograms  (Figure  5  A;  data  not  shown;  compare  with 
Figure  5B).  In  contrast,  Arg- (K/N)  14-eA- pgal  bearing  the 
weaker  N-degron  of  clone  4  was  readily  detectable  at  time 
0,  and  decayed  more  slowly  afterwards  (Figure  5A),  in 
agreement  with  the  results  of  steady-state  Pgal  assays 
(Figure  6B).  The  distribution  of  three  lysines  in  the 
strongest  N-degron  (clone  119:  Lys3,  8,  15)  (Figure  4) 
was  similar  to  that  in  a  relatively  weak  one  (clone  4: 
Lys4,  12,  14)  (Figure  6),  emphasizing  the  importance  of 
lysines  at  positions  3  and  15. 

To  address  in  more  detail  the  relative  contributions  of 
the  lysines  at  positions  3  and  15  to  the  activity  of  a 
K/N-based  N-degron,  site-directed  mutagenesis  was  used 
to  construct  the  otherwise  identical  N-degrons  that  con¬ 
tained  either  exclusively  Lys3  and  Lys  15  (no  lysine  at  a 
third,  interior  position),  exclusively  Lys3  or  exclusively 
Lys  15  (Figure  6A).  The  resulting  Arg-(K/N)14-eA-Pgal 


proteins  were  examined  using  both  steady-state 
(Figure  6B)  and  pulse-chase  assays  (Figure  5C  and  D). 
The  K/N-based  N-degron  that  contained  only  Lys3  and 
Lys  15  was  active  but  considerably  less  so  than  the  strongest 
K/N-based  N-degron  of  clone  119,  which  contained  Lys8 
as  well  (Figures  4,  5C  and  6).  The  elimination  of  either 
Lys3  or  Lys  15  further  weakened  the  N-degron,  so  that  the 
Ly$3  and  Lys  15  versions  of  Arg- (K/N)  14-eA- Pgal  were, 
respectively,  slightly  and  moderately  short-lived  proteins 
(Figures  5C,  D  and  7). 

Mechanistic  implications 

One  finding  of  this  work  is  that  an  N-degron  can  be 
greatly  strengthened  by  spiking  it  with  additional  lysine 
residues.  The  resulting  degradation  signals  remained 
specific:  nearly  all  of  the  enhanced  N-degrons  were 
completely  inactive  in  ubrIA  cells,  which  lacked  the  E3 
(recognition)  component  of  the  N-end  rule  pathway.  In 
addition,  the  enhanced  N-degrons  could  be  inactivated  by 
replacing  their  destabilizing  N- terminal  residue  with  Met, 
a  stabilizing  residue  in  the  N-end  rule. 

The  strength  of  a  K/N-based  N-degron  depends  on  the 
arrangement  of  lysines  in  the  14-residue  N-terminal  region 
of  the  test  protein.  This  dependence  is  both  strong  and 
complex  (Figures  4  and  6).  The  patterns  observed  may 
reflect  distinct  conformational  flexibilities  of  different 
K/N  sequences  vis-a-vis  the  relatively  fixed  spatial  arrange¬ 
ment  of  the  type  1  or  2  sites  of  Ubrlp  (N-recognin)  and 
its  associated  Ubc2p  E2  enzyme  (Madura  et  al. ,  1993). 
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Fig.  5.  Pulse-chase  analysis  of  K/N-based  N-degrons.  (A)  JD47-13C 
( UBRI )  S.cerevisiae  that  expressed  the  UPR-based  fusions  Met-eK- 
pgal  (Mek)  (DHFR-ha-UbR48-Met-eK-|3gal),  R-eK-Pgal  (Rek),  clone 
113  of  Arg-(K/N),4-eA-pgal  (KN113),  clone  119  of  Arg-(K/N)14-eA- 
pgal  (KN119),  clone  4  of  Arg-(K/N)14-eA-pgal  (KN4)  and  other  clones 
(see  Figures  IB  and  4)  were  labeled  with  [35S]methionine/cysteine  for 
5  min  at  30°C,  followed  by  a  chase  for  0,  10  and  30  min,  extraction, 
immunoprccipitation  with  anti-ha  and  anti-pgal  antibodies,  SDS-PAGE 
and  autoradiography  (see  Materials  and  methods).  The  bands  of 
X-pgal  (test  protein)  and  DHFR-ha-UbR48  (reference  protein)  are 
indicated  on  the  right.  (B)  As  in  (A),  but  with  the  congenic  JD55 
(ubriA)  cells.  (C)  As  in  (A)  but  with  the  clones  K3,  K15  and  K3K15 
(see  Figure  6).  (D)  As  in  (C)  but  with  JD55  (ubrJ A)  cells. 


Some  of  the  conclusions  indicated  by  our  data  are 
described  below. 

(i)  A  completely  uniform  feature  of  all  68  K/N-based 
N-degrons  was  the  absence  of  Lys  from  position  2 
(Figure  4;  data  not  shown),  consistent  with  the  fact  that 
all  of  the  previously  examined  N-degrons  (Varshavsky, 
1996)  also  lacked  a  strongly  basic  residue  (Arg  or  Lys) 
at  position  2.  Using  site-directed  mutagenesis  with  clone 
119  (Figure  4),  we  confirmed  that  lysine  is  not  tolerated 
at  position  2  of  an  N-degron  (data  not  shown). 

(ii)  The  strongest  K/N-based  N-degrons  contained  single 
Lys  residues,  surrounded  by  Asn  residues  (Figure  4),  in 
contrast  to  the  Lys-Arg-Lys  (KRK)  motif  present  in  the 
eK  extension  of  N-degrons  studied  previously  (Figure  1A). 
We  conclude  that  a  lysine-containing  motif  of  three 
adjacent  basic  residues  is  not  an  essential  feature  of  an 
N-degron. 

(iii)  The  presence  of  Lys  15  in  all  of  the  most  active 
K/N-based  N-degrons  (Figure  4)  is  consistent  with  the 
earlier  model  (Bachmair  and  Varshavsky,  1989;  Varshav¬ 
sky,  1996)  in  which  a  targetable  Lys  residue  should 
be  sufficiently  far  along  the  chain  from  a  destabilizing 
N-terminal  residue  to  allow  the  formation  of  a  loop  that 
positions  this  lysine  spatially  close  to  the  N-terminal 
residue.  This  model  is  also  consistent  with  the  finding 
that  Lys3  is  another,  nearly  invariant,  component  of  a 
K/N-based  N-degron.  Specifically,  Lys3  may  be  located 
at  the  uniquely  favorable  ‘linear’  distance  from  the 
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Fig.  6.  Site-directed  mutagenesis  of  N-degrons  identified  through  the 
screen  in  the  lysine-asparagine  sequence  space.  (A)  K/N  sequences  of 
clones  119,  113  and  4  and  the  K3/K15  derivatives  of  clone  119  (K3, 
K15  and  K3K15).  The  relative  activity  of  these  N-degrons,  defined  as 
the  ratio  of  pgal  activities  in  the  ubriA  versus  UBRI  cells  expressing 
a  given  Arg-(K/N)14-eA-pgal  protein,  is  indicated  on  the  right.  (B)  The 
levels  of  pgal  activity  in  extracts  from  UBRI  cells  (filled  bars)  and 
ubriA  cells  (striped  bars)  cells  expressing  the  indicated  test  proteins. 
Standard  deviations  (for  triplicate  measurements)  are  indicated. 


wt  ubriA 


Fig.  7.  Decay  curves  of  K3  and/or  K15  mutants  of  Arg-(K/N))4-eA- 
pgal  in  UBRI  and  ubriA  S.cerevisiae.  Pulse-chase  patterns  of  Met-eK- 
pgal  (MeK),  Arg-eK-pgal  (ReK),  and  Lys3  (K3),  Lys  15  (K15)  and 
Lys3/Lysl5  (K3K15)  variants  of  Arg-(K/N)14-eA-pgal  in  Figure  5C 
and  D  were  quantitated  as  described  in  the  legend  to  Figure  3  and 
Materials  and  methods. 


N-terminal  residue,  allowing  its  proximity  to  the  destabiliz¬ 
ing  N-terminal  residue  in  the  absence  of  loop  formation. 
That  this  view  is  at  best  incomplete  is  indicated  by  the 
finding  that  either  Lys3  alone  or  Lys  15  alone  cannot 
substitute  for  the  combination  of  Lys3,  Lys8  and  Lys  15 
that  defines  the  strongest  K/N-based  N-degron  identified 
to  date  (clone  119  in  Figures  4  and  6).  Moreover,  even 
Lys3  and  Lys  15  together,  in  the  absence  of  Lys8,  result 
in  a  much  weaker  N-degron  that  the  three-lysine  N-degron 
of  clone  119  (Figure  6).  One  possibility  is  that  Lysl5  is 
the  only  lysine,  among  the  three,  that  can  function  as  the 
site  of  ubiquitylation,  and  that  lysines  at  positions  3  and 
8  are  required  largely  for  optimal  conformational  flexibility 
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of  the  14-residue  K/N  region  in  this  N-degron.  It  is 
unlikely  that  the  much  higher  activity  of  the  clone  119 
(three-lysine)  N-degron  resulted  simply  from  its  higher 
positive  charge,  in  comparison  with  the  two-lysine 
N-degrons,  because  most  of  the  other  K/N  sequences 
containing  three  or  more  lysines  were  much  less  active 
than  the  clone  119  N-degron  (Figure  4). 

We  searched  for  N-degrons  in  a  K/N  region  14  residues 
long  (Figures  IB  and  4),  in  part  because  the  search  with 
a  significantly  longer  K/N  region  would  have  precluded 
the  screen  from  being  exhaustive.  Nevertheless,  since  the 
eK  extension  of  the  earlier  N-degrons  is  40  residues  long 
(Figure  1A),  and  since  a  part  of  the  eK  extension  was 
retained,  downstream  of  the  K/N  region,  in  our  test 
proteins  (see  Materials  and  methods),  it  remains  to  be 
determined  whether  a  K/N-based  N-degron  could  be  made 
even  stronger  by  replacing  the  entire  eK  with  a  relevant 
K/N  motif.  For  example,  one  could  take  advantage  of  the 
already  identified  14-residue  K/N  sequence  of  clone  119 
(Figure  4),  and  carry  out  an  analogous  screen  for  N-degrons 
in  which  the  K/N  sequence  of  clone  119  is  fixed,  while 
the  downstream,  eK-derived  sequence  is  replaced  by  ran¬ 
dom  K/N  motifs.  In  sum,  the  K/N  strategy  of  this  work 
is  far  from  exhausted  by  the  present  screen,  and  can  be 
used,  for  example,  to  produce  even  stronger  N-degrons, 
to  search  for  other  K/N-based  degradation  signals  and  to 
probe  the  targeting  mechanisms  of  the  Ub  system. 

Concluding  remarks 

One  implication  of  our  results  is  the  possibility  of  con¬ 
structing  much  stronger  N-degrons.  These  portable 
degradation  signals  can  be  used  to  render  proteins  of 
interest  that  are  short-lived  in  either  conditional  or  uncondi¬ 
tional  settings  (Dohmen  et  al,  1994;  Worley  etal,  1998). 

Another  implication  of  our  results  stems  from  the 
demonstrated  feasibility  of  defining  a  specific  class  of 
degradation  signals  in  a  sequence  space  of  just  two  amino 
acids.  Since  the  N-degrons  have  been  defined  previously 
in  much  more  complex  sequence  contexts  (Varshavsky, 
1996),  our  findings  suggest  that  other  classes  of  degrada¬ 
tion  signal,  and  some  of  the  other  targeting  signals  as 
well,  could  also  be  identified  and  examined  in  this  simple- 
sequence  setting.  Low-complexity  sequences  eliminate 
some  of  the  informational  ‘noise’  of  natural  sequences 
and  thereby  help  to  define  the  major  determinants  of 
structural  specificity.  The  advantages  of  the  simple- 
sequence  approach  have  long  been  recognized  by  research¬ 
ers  who  study  the  fundamentals  of  protein  folding  (Clarke, 
1995).  To  our  knowledge,  the  present  work  is  the  first  to 
extend  the  simple-sequence  approach  to  the  realm  of 
protein  degradation. 

Materials  and  methods 

Strains ,  media  and  genetic  techniques 

Saccharomyces  cerevisiae  strains  used  in  this  work  were  JD47-13C 
(MA7a  ura3-52  lys2-801  trpl-A63  his3-A200  leu2-3  112)  (Madura 
et  al ,  1993),  JD55  (MAT*  ura3-52  lys2-801  trpl-A63  his3-A200  leu2-3 
112  ubrl-Al::HIS3)  (Madura  and  Varshavsky,  1994)  and  JD54  (MATa 
ura3-52  lys2-801  trpl-A63  his3-A200  leu2-3 112  GAL1  ::UBR1)  (Ghislain 
et  al,  1996).  Rich  (YPD)  medium  contained  1%  yeast  extract,  2% 
peptone  (Difco)  and  2%  glucose.  Synthetic  media  (Ausubel  et  al,  1996) 
contained  either  2%  dextrose  (glucose)  (SD  medium),  3%  raffinose  (SR 
medium)  or  3%  galactose  (SG  medium).  To  induce  the  P Cupi  promoter, 


CuS04  was  added  to  a  final  concentration  of  0.2  mM.  Cells  were 
incubated  and  assayed  at  30°C.  Transformation  of  S.  cerevisiae  faas 
carried  out  using  the  lithium  acetate  method  (Ausubel  et  al,  1996). 

Plasmid  construction 

The  plasmids  encoded  Ub  fusions  of  the  UPR  technique  (Levy  et  al, 
1996)]  (see  Results  and  discussion).  The  reference  protein  was  mouse 
DHFR  fused,  through  a  20-residue  spacer  containing  the  ha  epitope,  to 
the  UbR48  moiety  bearing  Arg  instead  of  wild-type  Lys  at  position  48 
(Levy  et  al,  1996).  The  plasmids  pDhaUbXeKpgal  expressed,  from  the 
P cupi  promoter,  the  fusions  DHFR-ha-UbR48-X-eK-pgal,  where  the 
junctional  residue  X  was  either  Met,  Tyr  or  His;  eK  was  a  previously 
described  40-residue,  E.coli  Lac  repressor-derived  N-terminal  extension 
(Bachmair  and  Varshavsky,  1989;  Johnson  et  al,  1992);  the  E.coli  pgal 
moiety  lacked  the  first  24  residues  of  wild-type  pgal  (Bachmair  et  al, 
1986).  The  pgal-coding  fragment  was  produced  by  PCR  amplification 
of  the  pgal  open  reading  frame  (ORF)  in  pUB23  (Bachmair  et  al. ,  1986; 
Bachmair  and  Varshavsky,  1989),  using  primers  BGALS1 
(5 ' -CAGAGATCTCTTA ATCGCCTTGC AGCA-3 ' )  and  RSAS2  (5'-C- 
CCTCGAGGTCGACGGTATCG-3 '),  and  the  Expand  PCR  System 
(Boehringer,  Indianapolis,  IN).  The  BgllUXhol-cut  PCR  product  was 
ligated  to  the  insert-lacking,  P Cupi  promoter-containing  Bglll-Xhol 
fragment  of  pdUbXekUra3,  a  pRS314-based,  low  copy  plasmid  (F.Levy 
and  A.Varshavsky,  unpublished  data).  ORFs  that  expressed  DHFR-ha- 
UbR48-X-eK-pgal  containing  a  modified  eK  region  [single,  double  or 
triple  insertions  of  the  sequence  Lys-Arg-Lys  (KRK)  (Figure  1A)]  were 
produced  using  PCR.  Specifically,  pDhaUbXKlpgal,  pDhaUbXK2Pgal, 
PDhaUbXK3Pgal,  pDhaUbXKKlpgal  and  pDhaUbXKKKiPgal  were 
constructed  by  replacing  the  BamHl-Bglll  fragment  encoding  X-eK-pgal 
with  DNA  fragments  amplified  using,  respectively,  the  sense  primers 
EKKS 1  (5'-CACGGATCCAAGAGAAAGGGAGCTTGGCTGTTGC- 
CC-3'),  EKKS2  (5'-CACGGATCCGGAGCTTGGCTGAAGAGAAAG- 
TTGCCCGTCTC ACTGGTG-3 ' ),  EKKS3  (5 '  -CACGGATCCGGAG- 
CTTGGCTGTTGCCCGTCTCAAAGAGAAAGCTGGTGAAAAGAA- 
A  A  ACC-3'),  EKKS4  (5 '  -CACGGATCC  A  AGAGA  AAGGGAGCTTG- 
GCTGAAGAGAAAGTTGCCCGTCTCACTGGTG-3')  and  EKKKS1 
(5 '  -CACGGATCC  A  AGAGA  A  AGGGAGCTTGGCTG  A  AGAGAAAG- 
TTGCCCGTCTCAAAGAGAAAGCTGGTGAAAAGAAAAACC-3'). 
The  primer  EKAS1  (5'-GGAAGATCTCTGCATTAATGAATC-3')  and 
the  plasmid  pDhaUbMeKpgal  served  as  the  antisense  primer  and  the 
template,  respectively.  The  plasmids  pDhaUbRK3eApgal,  pDhaUbRK15- 
eAPgal  and  pDhaUbRK3K15eApgal,  which  expressed,  respectively, 
DHFR-ha-UbR48-Arg-Asn-Lys-(Asn)i2-eA-pgal,  DHFR-ha-UbR48-Arg- 
(Asn)  \  3-Lys-eA-pgal  and  DHFR-ha-UbR48- Arg- Asn-Ly s( Asn)  j  j  -Lys-eA- 
pgal,  were  constructed  as  follows.  The  term  eA  denotes  the  sequence 
HGS G AWLLP V S LVRS ,  a  13-residue  derivative  of  the  40-residue  eK 
(residues  2-14),  followed  by  two  residues  Arg  and  Ser,  encoded  by  the 
Bglll  site  (Figure  IB).  Double-stranded  oligonucleotides  that  encom¬ 
passed  the  5'  end  of  the  Sacll  site,  the  end  of  Ub-Arg-(Lys/ Asn)  14- 
coding  sequences  (see  below),  and  the  3'  end  of  the  BamHl  site  were 
digested  with  Sacll  and  BamHl.  The  resulting  fragments  were  ligated  to 
the  Sacll-BamHl  vector-containing  fragment  of  pDhaUbReApgal.  The 
latter  plasmid  was  produced  by  inserting  annealed  complementary 
oligonucleotides  EDKS1  and  EDAS1,  which  encoded  the  eA  sequence 
HGS  G  AWLLP  VSLVRS  (see  above),  into  BamHUB  gill-cut  pDhaUbReK- 
Pgal.  All  constructs  were  verified  by  nucleotide  sequencing. 

A  screen  for  N-degrons  in  the  lysine-asparagine  sequence 
space 

The  library  of  pDhaUbReApgal-derived  plasmids  was  constructed  that 
expressed,  from  the  P Cupi  promoter,  Arg-(K/N)]4-eA-pgal  proteins  that 
contained  random- sequence  14-residue  Lys/Asn(K/N)-inserts  between 
the  N-terminal  Arg  and  the  eA  moiety  (Figure  IB).  Arg-(K/N)14-eA-pgals 
were  the  products  of  co-translational  cleavage  of  DHFR-ha-UbR48-Arg- 
(K/N)14-eA  -pgals  (Figure  IB).  To  produce  the  library,  a  method  for 
cloning  random-sequence  oligonucleotides  produced  by  mutually  primed 
synthesis  was  used  (Oliphant  et  al,  1986;  Ghislain  et  al,  1996).  A  set 
of  oligonucleotides  5 '  -CGCCCGCGGTGGTAGG(A  AA/T) ,  jCACG- 
GATCCG-3'  that  contained  random  permutations  of  14  codons,  either 
AAA  (encoding  Lys)  or  AAT  (encoding  Asn),  flanked  by  the  Sacll  and 
BamHl  sites  (underlined),  was  synthesized.  The  oligonucleotides  were 
converted  into  their  double- stranded  counterparts  with  Klenow  Pol  I 
(Ausubel  et  al,  1996),  and  were  digested  with  Sacll  and  BamHl,  yielding 
a  set  of  equal-length  fragments  containing  randomly  permuted  (AAA/ 
T)i4  inserts.  The  fragments  were  ligated  to  the  Sacll-BamHl  vector- 
containing  fragment  of  pDhaUbReApgal.  The  resulting  library,  in  the 
plasmid  termed  pRKN14,  was  introduced  into  E.coli  DH5a  by  electropor- 
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ation  (Ausubel  et  al,  1996).  Digestion  of  the  pool  of  recovered  plasmids 
with  appropriate  restriction  enzymes  showed  that  >90%  of  transformants 
contained  an  oligonucleotide-derived  insert.  The  pRKN14  library  was 
transformed  into  S.cerevisiae  JD54,  in  which  Ubrlp  was  expressed  from 
the  P qali  promoter.  Approximately  2  X  104  transformants  growing  on 
SD(-Trp)  plates  containing  0.2  mM  CuS04  were  replica-plated  onto 
CuS04-containing  SG(-Trp)  plates.  The  XGal-based  filter  assay  (Ausubel 
et  al ,  1996)  was  used  to  screen  for  colonies  that  were  white  (low  levels 
of  pgal)  on  SG(-Trp)  plates  but  blue  (high  levels  of  pgal)  on  SD(-Trp) 
plates.  These  colonies  were  grown  up  in  liquid  SG(-Trp),  and  the  activity 
of  pgal  was  determined  as  described  below.  Plasmid  DNA  was  isolated 
from  the  positive  transformants,  amplified  in  E.coli  and  transformed 
into  S.cerevisiae  JD47-13C  (UBRI)  and  JD55  (ubrlA).  The  metabolic 
stabilities  of  the  corresponding  Arg-(K/N)14-eA-pgal  proteins  in  ubrlA 
versus  wild-type  {UBRI)  S.cerevisiae  were  determined  by  measuring 
pgal  activity,  and  also  directly,  by  carrying  out  pulse-chase  assays. 

Measurement  of  pgal  activity 

Saccharomyces  cerevisiae  cells  were  added  to  5  ml  of  SD(-Trp) 
containing  0.2  mM  CuS04,  and  grown  to  4600  ~L  Cells  were  gently 
pelleted  by  centrifugation,  lysed  with  glass  beads  in  20%  glycerol,  1  mM 
dithiothreitol,  0.1  M  Tris-HCl  pH  8,  and  the  activity  of  pgal  was 
measured  in  the  clarified  extract  using  o-nitrophenyl-p-D-galactoside,  as 
described  (Ausubel  et  al.,  1996).  The  activity  was  normalized  to  the  total 
protein  concentration,  determined  using  the  Bradford  assay  (Bio-Rad, 
Hercules,  CA). 

Pulse-chase  assays 

Transformed  JD47-13C  and  JD55  cells  from  10  ml  cultures  {Am  0.5-1) 
in  SD(-Trp)  containing  0.2  mM  CuS04  were  gently  pelleted  by  centrifu¬ 
gation,  and  washed  in  the  same  medium.  The  cells  were  resuspended  in 
0.3  ml  of  SD(-Trp)  containing  CuS04  and  labeled  for  5  min  at  30°C 
with  0.15  mCi  (5.5  MBq)  of  [35S]methionine/cysteine  (Trans35S-label, 
ICN,  Costa  Mesa,  CA).  The  cells  were  harvested  by  centrifugation, 
resuspended  in  0.3  ml  of  SD(-Trp),  10  mM  L-methionine,  0.5  mg/ml 
cycloheximide  and  incubated  further  at  30°C.  At  each  time  point,  0.1  ml 
samples  were  withdrawn  and  added  to  0.7  ml  of  the  lysis  buffer  (1% 
Triton  X-100,  0.15  M  NaCl,  1  mM  EDTA,  50  mM  Na-HEPES  pH  7.5) 
containing  1  mM  phenylmethylsulfonyl  fluoride.  The  cells  were  then 
lysed  by  vortexing  with  0.5  ml  of  0.5  mm  glass  beads  four  times  for 
1  min,  with  intermittent  cooling  on  ice,  followed  by  centrifugation  at 
12  000  g  for  10  min.  The  volumes  of  supernatants  were  adjusted  to 
equalize  the  amounts  of  10%  trichloroacetic  acid-insoluble  35S,  followed 
by  immunoprecipitation  with  a  mixture  of  saturating  amounts  of  mono¬ 
clonal  antibodies  against  the  ha  epitope  (Babco,  Berkeley,  CA)  and  pgal 
(Promega,  Madison,  WI).  The  samples  were  incubated  at  4°C  for  2  h, 
with  rotation,  followed  by  the  addition  of  10  pi  of  protein  A-Sepharose 
suspension  (Repligen,  Cambridge,  MA),  further  incubation  for  1  h,  and 
centrifugation  at  12  000  g  for  30  s.  The  immunoprecipitates  were  washed 
four  times  with  0.8  ml  of  the  lysis  buffer  plus  0.1%  SDS,  resuspended 
in  SDS-sample  buffer  (Ausubel  et  al,  1996),  heated  at  100°C  for  3  min 
and  fractionated  by  SDS-10%  PAGE,  followed  by  autoradiography  and 
quantitation  with  a  Phosphorlmager  (Molecular  Dynamics.  Sunnyvale, 
CA),  using  the  reference  provided  by  the  UPR  technique  (Figures  2 
and  5)  (Levy  et  al. ,  1996). 

Pulse-chase  assays  with  JD54  cells,  in  which  UBRI  was  expressed 
from  the  P GAU  promoter,  were  carried  out  by  growing  20  ml  cultures 
to  A6qq  0.5-1  in  SR(-Trp),  collecting  and  resuspending  the  cells  in 
0.5  ml  of  the  same  medium  containing  0.2  mM  CuS04  and  labeling 
with  0.3  mCi  of  [35S]methionine/cysteine  for  10  min  at  30°C.  The  cells 
were  washed  twice  with  SR(-Trp),  and  transferred  to  0.5  ml  of  SR 
(-Trp)  containing  10  mM  L-methioninc  and  1  mM  cysteine.  After  a 
further  20  min  incubation  at  30°C,  galactose  was  added  (to  a  final 
concentration  of  3%)  to  induce  the  expression  of  UBRI  and  initiate  the 
chase,  whose  time  points  are  indicated  in  Figures  2D  and  3D. 

ID5,  the  extent  of  initial  decay  at  the  end  of  the  5  min  pulse,  was 
calculated  as  follows:  ID5  -  {1  -  [Xpgal]0/[MeteKpgal]o}  X  100%. 
This  parameter  (the  upper  index  refers  to  the  length  of  pulse)  equals 
100%  minus  the  ratio  of  35S  in  an  X-pgal  to  35S  in  the  reference  protein 
dha-Ub  (DHFR-ha-UbR48)  at  the  end  of  the  pulse,  normalized  against 
the  same  ratio  with  metabolically  stable  Met-eK-Pgal  (Levy  et  al,  1996). 
To  denote  the  observed  half-lives  of  a  test  protein  at  different  regions 
of  a  non-exponential  decay  curve,  a  generalized  half-life  term  roTs  was 
used,  in  which  0.5  denotes  the  parameter’s  half-life  aspect  and  y-z 
denotes  the  relevant  time  interval,  from  y  to  z  min  of  chase. 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal  residue.  The 
underlying  ubiquitin-dependent  proteolytic  system,  called  the  N-end  rule  pathway,  is  organized  hierarchically: 
N-terminal  aspartate  and  glutamate  (and  also  cysteine  in  metazoans)  are  secondary  destabilizing  residues,  in 
that  they  function  through  their  conjugation,  by  arginyl-tRNA-protein  transferase  (R-transferase),  to  arginine, 
a  primary  destabilizing  residue.  We  isolated  cDNA  encoding  the  516-residue  mouse  R-transferase,  ATElp,  and 
found  two  species,  termed  and  Atel-2.  Th eAtel  mRNAs  are  produced  through  a  most  unusual  alternative 

splicing  that  retains  one  or  the  other  of  the  two  homologous  129-bp  exons,  which  are  adjacent  in  the  mouse  Atel 
gene.  Human  ATE1  also  contains  the  alternative  129-bp  exons,  whereas  the  plant  (Arabidopsis  thaliana)  and  fly 
(Drosophila  melanogaster)  Atel  genes  encode  a  single  form  of  ATElp.  A  fusion  of  ATEl-lp  with  green  fluorescent 
protein  (GFP)  is  present  in  both  the  nucleus  and  the  cytosol,  whereas  ATEl-2p-GFP  is  exclusively  cytosolic. 
Mouse  ATEl-lp  and  ATEl-2p  were  examined  by  expressing  them  in  atelk  Saccharomyces  cerevisiae  in  the 
presence  of  test  substrates  that  included  Asp-pgal  (p-galactosidase)  and  Cys-pgal.  Both  forms  of  the  mouse 
R-transferase  conferred  instability  on  Asp-pgal  (but  not  on  Cys-Pgal)  through  the  arginylation  of  its  N- 
terminal  Asp,  the  ATEl-lp  enzyme  being  more  active  than  ATEl-2p.  The  ratio  of  Atel-1  to  Atel-2  mRNA  varies 
greatly  among  the  mouse  tissues;  it  is  —0.1  in  the  skeletal  muscle,  —0.25  in  the  spleen,  —3.3  in  the  liver  and 
brain,  and  -10  in  the  testis,  suggesting  that  the  two  R-transferases  are  functionally  distinct. 


The  half-lives  of  intracellular  proteins  range  from  a  few 
seconds  to  many  days.  The  rates  of  processive  proteolysis  are  a 
function  of  the  cell’s  physiological  state  and  are  controlled 
differentially  for  specific  proteins.  In  particular,  most  of  the 
damaged  or  otherwise  abnormal  proteins  are  metabolically 
unstable.  Many  other  proteins,  while  long-lived  as  components 
of  larger  macromolecular  structures  such  as  ribosomes  and 
oligomeric  proteins,  are  metabolically  unstable  as  free  sub¬ 
units.  Regulatory  proteins  are  often  also  short-lived  in  vivo, 
providing  a  way  to  generate  their  spatial  gradients  and  to 
rapidly  adjust  their  concentrations,  or  subunit  compositions, 
through  changes  in  the  rate  of  their  synthesis  or  degradation 
(20,  23,  28,  39,  44,  55). 

The  posttranslational  conjugation  of  arginine  (Arg)  to  the  N 
termini  of  eukaryotic  proteins  was  described  35  years  ago  (26), 
but  the  function  of  this  modification,  and  of  the  enzyme  in¬ 
volved,  Arg-tRNA-protein  transferase  (R-transferase)  (47),  re¬ 
mained  unknown  until  the  discovery  that  the  identity  of  N- 
terminal  residue  in  a  protein  influences  its  metabolic  stability 
(4).  The  resulting  relation  was  termed  the  N-end  rule  (54). 
Aspartate  (Asp)  and  glutamate  (Glu),  the  two  N-terminal  res¬ 
idues  known  to  be  arginylated  by  R-transferase  (47),  were 
shown  to  be  destabilizing  residues  in  the  N-end  rule  (4).  It  was 
therefore  proposed  (4)  that  the  function  of  R-transferase  is  to 
target  proteins  for  degradation  by  conjugating  Arg,  one  of  the 
primary  destabilizing  residues,  to  secondary  destabilizing  N- 
terminal  residues  (Asp  and  Glu  in  fungi;  Asp,  Glu,  and  Cys  in 
metazoans  (18)  (Fig.  1).  It  was  also  proposed  (4)  that  the 
analogous  prokaryotic  enzyme  Leu,  Phe-tRNA-protein  trans¬ 
ferase  (L,  F-transferase)  (47)  mediates  the  activity  of  N-termi- 
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nal  Arg  and  Lys,  which,  in  prokaryotes,  would  be  the  secondary 
destabilizing  residues.  These  conjectures  were  confirmed  (7, 
17,  45,  53). 

The  similar  but  distinct  degradation  signals  which  together 
give  rise  to  the  N-end  rule  are  called  the  N-degrons  (54,  56).  In 
eukaryotes,  an  N-degron  comprises  two  determinants:  a  desta¬ 
bilizing  N-terminal  residue  and  an  internal  Lys  residue  of  a 
substrate  (5,  22).  The  Lys  residue  is  the  site  of  formation  of  a 
substrate-linked  multiubiquitin  chain  (11).  The  N-end  rule 
pathway  is  thus  one  pathway  of  the  ubiquitin  (Ub)  system.  Ub 
is  a  76-residue  protein  whose  covalent  conjugation  to  other 
proteins  plays  a  role  in  a  multitude  of  processes,  including  cell 
growth,  division,  differentiation,  and  responses  to  stress  (20, 
23,  39,  55).  In  many  of  these  processes,  Ub  acts  through  routes 
that  involve  the  degradation  of  Ub-protein  conjugates  by  the 
26S  proteasome,  an  ATP-dependent  multisubunit  protease  (9, 
13,  40,  43). 

(In  the  text  that  follows,  names  of  mouse  genes  are  in  italics, 
with  the  first  letter  uppercase.  Names  of  human  and  Saccha¬ 
romyces  cerevisiae  genes  are  also  in  italics,  all  uppercase.  If 
human  and  mouse  genes  are  named  in  the  same  sentence,  the 
mouse  gene  notation  is  used.  Names  of  S.  cerevisiae  proteins 
are  roman,  with  the  first  letter  uppercase  and  an  extra  lower¬ 
case  “p”  at  the  end.  Names  of  the  corresponding  mouse  and 
human  proteins  are  the  same,  except  that  all  letters  but  the  last 
“p”  are  uppercase.  The  latter  usage  is  a  modification  of  the 
existing  convention  (50),  to  facilitate  simultaneous  discussions 
of  yeast,  mouse,  and  human  proteins.  In  some  citations,  the 
abbreviated  name  of  a  species  precedes  the  gene’s  name.) 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast  S . 
cerevisiae ,  Asn  and  Gin  are  tertiary  destabilizing  N-terminal 
residues  in  that  they  function  through  their  conversion,  by  the 
NTA1 -encoded  N-terminal  amidase  (Nt-amidase)  (6),  to  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
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FIG.  1.  Comparison  of  enzymatic  reactions  that  underlie  the  activity  of  the 
tertiary  and  secondary  destabilizing  residues  among  eukaryotes.  (A)  Mammals 
(reference  54  and  this  work);  (B)  the  yeast  S.  cerevisiae  (5).  N-terminal  residues 
are  indicated  by  single-letter  abbreviations  for  amino  acids;  ovals  denote  the  rest 
of  a  protein  substrate.  The  Ntanl  -encoded  mammalian  Nt-amidase  converts 
N-terminal  Asn  to  Asp,  whereas  N-terminal  Gin  is  deamidated  by  a  distinct 
Nt-amidase  that  remains  to  be  identified  (19,  51).  In  contrast,  the  yeast  Nt- 
amidase  can  deamidate  either  N-terminal  Asn  or  Gin  (6).  The  secondary  desta¬ 
bilizing  residues  Asp  and  Glu  are  arginylated  by  the  mammalian  ATEl-lp  or 
ATEl-2p  R-transferase  (see  Results).  A  Cys-specific  mammalian  R-transferase 
remains  to  be  identified  (see  Results).  N-terminal  Arg,  one  of  the  primary 
destabilizing  residues  (54),  is  recognized  by  N-recognin  (E3)  (see  the  introduc¬ 
tion). 


destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the  S.  cerevisiae  ATE1  -encoded  Arg-tRNA- 
protein  transferase  (R-transferase),  to  Arg,  one  of  the  primary 
destabilizing  residues  (7)  (Fig.  IB).  In  mammals,  the  deami¬ 
dation  step  is  bifurcated,  in  that  two  distinct  Nt-amidases  spe¬ 
cific,  respectively,  for  N-terminal  Asn  and  Gin,  mediate  the 
activity  of  tertiary  destabilizing  residues  (19,  51)  (Fig.  1A). 
Mice  lacking  the  Asn-specific  Nt-amidase  NTANlp  have  re¬ 
cently  been  produced  through  targeted  mutagenesis  and  found 
to  be  fertile,  outwardly  normal,  but  behaviorally  distinct  from 
their  congenic  wild-type  counterparts  (28a).  In  mammals,  the 
set  of  secondary  destabilizing  residues  contains  not  only  Asp 
and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in  yeast  (18, 
54)  (Fig.  1). 

The  primary  destabilizing  N-terminal  residues  are  bound 
directly  by  the  UBRl-e ncoded  N-recognin  (also  called  E3ot), 
the  recognition  component  of  the  N-end  rule  pathway  (8).  In  S. 
cerevisiae ,  N-recognin  is  a  225-kDa  protein  that  binds  to  po¬ 
tential  N-end  rule  substrates  through  their  primary  destabiliz¬ 
ing  N-terminal  residues,  Phe,  Leu,  Trp,  Tyr,  lie,  Arg,  Lys,  and 
His  (54).  The  Ubrl  genes  encoding  mouse  and  human  N- 
recognins,  also  called  Ea  (21,  41),  have  been  cloned  (29),  and 
mouse  strains  lacking  Ubrl  have  recently  been  constructed 
(29a). 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  peptide  import  in  S.  cerevisiae,  through  the  degra¬ 
dation  of  Cup9p,  a  transcriptional  repressor  of  PTR2  which 
encodes  the  peptide  transporter  (2, 10);  a  role  in  regulating  the 
Slnlp-dependent  phosphorylation  cascade  that  mediates  os¬ 
moregulation  in  S.  cerevisiae  (38);  the  degradation  of  Gpalp,  a 
Ga  protein  of  S.  cerevisiae  (34);  and  the  conditional  degrada¬ 
tion  of  alphaviral  RNA  polymerase  in  virus-infected  metazoan 
cells  (16,  54).  Physiological  N-end  rule  substrates  were  also 
identified  among  the  proteins  secreted  into  the  host  cell’s  cy¬ 
tosol  by  intracellular  parasites  such  as  the  bacterium  Listeria 
monocytogenes  (46).  Inhibition  of  the  N-end  rule  pathway  was 
reported  to  interfere  with  mammalian  cell  differentiation  (24) 
and  to  delay  limb  regeneration  in  amphibians  (52).  Microar¬ 
ray-based  comparisons  of  gene  expression  patterns  in  wild-type 
and  congenic  ubrl  A  strains  of  S .  cerevisiae  have  shown  that  a 
number  of  yeast  genes,  of  diverse  functions,  are  significantly 


up-  or  down-regulated  in  the  absence  of  the  N-end  rule  path¬ 
way  (39a). 

The  mammalian  counterpart  of  the  yeast  ATE1 -encoded 
R-transferase  was  partially  purified  from  rabbit  reticulocytes 
and  shown  to  cofractionate  with  Arg-tRNA  synthetase  (12). 
Recent  studies  of  the  Ub-dependent  proteolysis  of  endogenous 
proteins  in  muscle  extracts  suggested  that  the  N-end  rule  path¬ 
way  plays  a  major  role  in  catabolic  states  that  result  in  muscle 
atrophy  (48,  49).  A  significant  fraction  of  the  N-end  rule  path¬ 
way’s  activity  in  muscle  extracts  was  found  to  be  tRNA  depen¬ 
dent,  indicating  the  involvement  of  R-transferase  (48,  49).  It 
was  also  reported  that  a  crush  injury  to  the  rat  sciatic  nerve 
results  in  a  ~  10-fold  increase  in  the  rate  of  arginine  conjuga¬ 
tion  to  the  N  termini  of  unidentified  proteins  in  the  nerve’s 
region  upstream  of  the  crush  site  (15,  57),  suggesting  an  injury- 
induced  increase  in  the  concentration  of  R-transferase  sub¬ 
strates  and/or  an  enhanced  activity  of  the  N-end  rule  pathway. 

In  this  work,  we  began  the  functional  analysis  of  mammalian 
R-transferase  (ATElp)  by  isolating  mouse  cDNA  encoding 
this  enzyme.  Surprisingly,  we  found  two  Atel  cDNA  species, 
which  were  identical  except  for  a  129-bp  region  that  encoded 
similar  but  distinct  sequences.  One  or  the  other,  but  not  both, 
of  the  corresponding  Atel  exons  is  retained  in  the  mature  Atel 
mRNA,  and  the  ratio  of  the  resulting  two  species,  Atel-1  and 
Atel -2,  varies  greatly  among  mouse  tissues.  We  also  show  that 
ATEl-lp  and  ATEl-2p,  while  differing  in  activity,  can  argin- 
ylate  N-terminal  Asp  and  Glu  in  model  substrates.  However, 
neither  of  them  can  arginylate  N-terminal  Cys,  the  known 
secondary  destabilizing  residue  (18),  suggesting  the  existence 
of  a  distinct  Cys-specific  mammalian  R-transferase. 

MATERIALS  AND  METHODS 

Strains  and  plasmids.  The  S.  cerevisiae  strains  used  were  JD55  (MATa  ura3-52 
his3-A200  Ieu2-3J12  trpl-A63  lys2-801  ubrl  A:\HIS3)  (34)  and  SGY3  {MATa 
ura3-52  lys2-801  ade2-101  trpl-A63  his3-A200  Ieu2-Alatel-A2::LEU2 )  (19a).  Cells 
were  grown  in  rich  medium  (YPD)  or  in  synthetic  media  (SD)  containing  0.67% 
yeast  nitrogen  base  without  amino  acids  (Difco),  auxotrophic  nutrients,  and  2% 
glucose.  To  induce  the  VGAL  promoter,  glucose  was  replaced  by  2%  galactose 
(SG  medium).  Transformation  of  S.  cerevisiae  was  performed  by  the  lithium 
acetate  method  (3). 

The  ubrl  A  atel  A  double  mutant  AVY34  was  constructed  by  replacing  93%  of 
the  ATE1  open  reading  frame  (ORF)  (the  first  470  codons)  in  strain  JD55 
(ubrl  A)  by  the  LEU2  gene,  through  homologous  recombination  (42)  with  the 
introduced  LEU2  gene  flanked  on  either  side  by  40  bp  of  ATE  1 -specific  se¬ 
quences.  Mutants  were  selected  on  SD  (lacking  Leu  and  His)  plates,  and  Leu+ 
isolates  were  checked  by  PCR  for  the  absence  of  ATE  1  and  by  colony  assays  (7, 
8)  for  the  absence  of  Atelp  and  Ubrlp  activity.  High-copy-number  pUB23-X 
plasmids  expressing  Ub-X-pgal  proteins  (see  below)  from  PGAL  in  S.  cerevisiae 
have  been  described  elsewhere  (4).  Mous e  Atel-1  and  Atel-2  cDNAs  (see  below) 
were  subcloned  into  the  low-copy-number  vector  p414GALl  (36),  using  the 
engineered  BamHl  (5')  and  Xhol  (3')  restriction  sites,  yielding  plasmids  pATl 
and  pAT2.  For  localization  assays  with  green  fluorescent  protein  (GFP),  cDNAs 
encoding  mouse  ATEl-lp  or  ATEl-2p  were  subcloned  into  the  pEGFP-Nl 
N-terminal  protein  fusion  vector  (Clontech,  Palo  Alto,  Calif.),  using  the  engi¬ 
neered  Xhol  (5')  andAgel  (3')  restriction  sites,  yielding  plasmids  pATl-GFP  and 
pAT2-GFP. 

Isolation  of  the  mouse  Atel-1  and  Atel-2  cDNAs.  The  392-bp  fragment  of  the 
mouse  EST  (expressed  sequence  tag)  clone  (accession  no.  AA415294),  which  was 
identified  in  GenBank  through  species  walking  (see  Results),  was  used  as  a  probe 
to  screen  a  kgtlO-based  mouse  cDNA  library  from  MEL-C19  cells  (Clontech), 
using  standard  procedures  (3).  Eight  positive  clones,  whose  inserts  ranged  from 
0.5  to  1.6  kb,  were  analyzed  by  PCR  and  partial  sequencing.  The  cDNA  inserts 
of  clones  3  and  8  were  then  subcloned  into  pBluescript  II  SK+  (29)  and  se¬ 
quenced  on  both  strands.  The  resulting  ORFs  were  identical  except  for  a  129-bp 
internal  region  (see  Results)  (Fig.  2A).  The  deduced  amino  acid  sequences  of  the 
mouse  cDNA  clones  3  and  8  were  weakly  but  significantly  similar  to  the  deduced 
sequences  of  Caenorhabditis  elegans  and  S.  cerevisiae  ATElp  (see  Results)  (Fig. 
2B)  and  corresponded  to  nucleotides  (nt)  699  to  1870  and  587  to  2099,  respec¬ 
tively,  in  the  subsequently  produced  full-length  mouse  Atel-1  and  Ate  1-2  cDNAs 
(accession  no.  AF079096  and  AF079097). 

A  human  EST  clone  (accession  no.  AA503372)  whose  deduced  amino  acid 
sequence  was  highly  similar  to  that  of  the  partial  mouse  Atel  cDNA  clone  8  was 
found  in  GenBank,  using  the  partial  mouse  ATElp  sequence  as  a  query.  This 
EST  clone  was  purchased  from  Genome  Systems  (St.  Louis,  Mo.)  and  sequenced 
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A  Atel  aaaaatccaaaaggaggaaacacgagaaacataagcataactctccatctggtgatgatagttctgaccgaaggcggcggtagcggcggctgcgtaggactgggtggcgcagcggcggcg  120 
Atel  GGGTCGCGGACATGGCTTCTTGGAGCGCGCCTTCACCTAGCCTTGTAGAGTATTTTGAGGGCCAGACCTCCTTCCAG^GTGGCTAC|GCAAGAACAAGTTGGGCAGTCGCTCCTATGGCA  240 

Atel  -1  TGTGGGCACATTCCATGACAGTGCAGGATTATCAGGATCTTATAGACCGAGGATGGAGAAGAAGTGGGAAATATGTGTACAAACCTGTCATGGATCAAACAJGC^GTCCTCAGTATACAA  360 

Atel  -1  TAAGGTGTCATCCTTTACAGTTTCAGCCATCAAAATCTCACAAGAAAGTTTTGAAAAAAATGCTGAAATTTCTGGCTAAAGGAGAGATCTCGAAAGGCAATTGTGAGGATGAGCCCATGG  4  80 
RCHPLQFQPSKSHKKVLKKMLKFLAKGEISKGNCEDEPMD  117 

Atel-1  ATTCTACAGTGGAGGATGCTGTTGACGGTGACTTTGCATTAATTAACAAGCTGGATATAAAGTGTGATCTCAAAACACTCAGTGACCTCAAAGGAAGCATAGAGAGTGAAGAGAAGGAGA  600 
STVEDAVDGDFALINKLDIKCDLKTLSDLKGSIESEEKEK  157 


Atel-1  AAGAAAAGAGTATAAAGAAAGAAGGGTCTAAAGAATTCATTCATCCACAATCTATAGAGGAGAAGTTGGGCTCTGGTGAACCATCACATCCAATCAAAGTTCATATTGGTCCTAAGCCAG  720 
EKSIKKEGSKEFIHPQSIEEKLGSGEPSHPIKVHIGPKPG197 

Atel-1  GCAAAGGGGCTGACTTGAGTAAGCCTCCATGTCGGAAAGCAAGGGAAATGAGGAAAGAAAGGCAAAGATTAAAACGGATGCAGCAGGCCTCAGCTGCAGCCTCGGAGGCTCAAGGTCAGC  840 
KGADLSKPPCRKAREMRKERQRLKRMQQASAAASEAQGQP  237 

0  0  L  0  P  A  0  F 

Atei-2  _ GTGAGGCTAGTACCTG CCTCCT 

Atel-1  CAGTCTGTTTGTTACCAAAGGCTAAATCCAACCAGCCCAAGTCACTGGAAGATTTGATTTTTCAATCTTTACCAGAAAATGCATCGCACAAGTTAGAGGTGAGGGTGGTAAGATCTTCTC  960 
V  C  L  L  P  K  A  K  S  N  Q  P  K  S  L  E  D  L  I  F  Q  S  L  P  E  N  A  S  H  K  L  E  Q  0  VI  R  S  0  P  277 

E  D  Q  E  0  N  S  S  0  N  Q  0  F  S  L  0  T  K  0  0  V  A  |  0  Q  E  A  0  E;  I  C  E  K  0  B 

Atel -2  TTGAGGACCCAGAGTTCAACTCATCCTTCAACCAGTCCTTTTCTTTATATACCAAGTATCAAGTGGCTATACACCAGGAAGCACCTGAAATATGTGAAAAGTCTGAG - 

Atel-1  CACCAAGTCCTCAGTTCAGAGCCACATTTCAGGAGTCTTACCAGGTCTATAAACGCTACCAGATGGTTGTTCACAAGGACCCGCCTGATAAGCCAACCGTGAGCCAGTTTACAAGATTCC  1080 

P  S  □  Q  0  R  A  T  0  Q  E  0  yi  Q  y  0  K  R  0  □  M  V  V  B  K  D  P  Q  D  K  P  T  V  0  Q  A  Q  G  Q  P  317 

Atel-1  TTTGCAGCTCACCATTGGAGGCAGAGCACCCTGCTGATGGACCAGAATGTGGTTATGGCTCCTTTCACCAGCAGTACTGGCTCGATGGGAAGATCATTGCTGTGGGGGTGTTAGACATTC  1200 
CSSPLEAEHPADGPECGYGSFHQQYW_LD  _G__K  I  I  A_  V  _G  V  L  D  I  L  357 

Atel-1  TCCCGTACTGTGTGTTTTCTGTGTATCTCTACTACGATCCTGATTATTCATTTCTGTCTTTGGGTGTCTATTCAGCATTAAG  AGAAATTGCTTTTACTAGACAACTGCATGAG  AAAACAT  1320 
PYCVFSVYLYYDPDYSFLSLGVYSALREIAFTRQLHEKTS  397 

Atel-2 _ — — _ .  -  — — , _ -  - _ _ _ — _ -  — — — — - - - - -  -  - —  -  - - — —  —  —  — 

Atel-1  CGCAACTCAGCTATTATTATATGGGTTTCTACATTCATTCCTGTCCCAAGATGAGATACAAGGGTCAATATAGACCTTCTGATTTGCTGTGTCCTGAGACGTATGTCTGGGTGCCCATTG  14  40 

QLSYYYMGFYIHSCPKMRYKGQYR  PSDL  LCPET  _ Y__!L-? _ ?  437 

A  t  e  1  ”  1  AGCAGTGCCTGCCTTCTCTGG  ACAACTCC  AAGTACTGCCGTTTC  AACCAGG  ACCCCGAAGCAGAGGATGAAGG  ACGCAGTAAGG  AACTTGACCGACTAAGGGTGTTTCACAGACGATCTG  1560 
QCLPSLDNSKYCRFNQDPEAEDEGRSKELDRLRVFHRRSA  477 

Atel-1  CCATGCCTTACGGTGTTTATAAGAATCACCAAGAAGACCCAAGTGAGGAGGCTGGTGTGCTGGAGTATGCAAACCTCGTAGGACAGAAGTGCTCGGAGAGGATGCTGCTGTTCAGACACT  1680 
MPYGVYKNHQ_EDPSEEAGVLEYANLVGQKCSERMLLFRH*516 

Atel-1  GAAGGCCTGCACCTCAGCCCAGCGCCAGCCTGGCATCTCTTGCCACAGTCCCACCTGCTTCTCGGCTCCTGCAGCCATTCACCAAGTACATTCGTAGAGCTTTTTAAAAATATTTTGTGG  1800 
AATTGTATTTATGAGAATCTTGTGGTTAATATAAAGATTTTAAAATATTCTATGACCTAGCCCTTTAATTGGGACTTGTCCTTTGC  AGCTTGCTTCTGG  AGCAAGGTGTGCTTCATGTCT  1920 
CC AGTGAATATTTAGGG AAGG AAAATGAAG AAGTCAATATAAGTCATTTTGTCACAGTTACTGTTCTGTGCTGGAGAAATTCTGAC ACAGAGACTGTTCTTTTAGACC AATGGTTCTTAA  204  0 
CCTGTGGGCTCACCACACTC ACAGGCCGAATGTTAGATAGCCTGTGTGAGATCAACTC A  2099 
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to  obtain  more  of  the  5 '-proximal  human  ATE1  sequence.  Reverse  transcription- 
PCR  (RT-PCR)  (3)  was  then  carried  out  with  poly(A)+  RNA  from  mouse 
embryonic  fibroblasts,  using  the  mouse  Atel -specific  reverse  primer  5'-CCTTT 
GGTAACAAACAGACTGGCTG-3 '  and  the  forward  primer  5'-TCTCATAG 
ACCGAGGATGGCGAAG-3',  whose  sequence  was  derived  from  the  above 
human  EST  clone.  The  resulting  PCR  products,  which  appeared  as  a  smear  upon 
agarose  gel  electrophoresis,  were  ligated  into  the  TA  cloning  vector  (Invitrogen, 
San  Diego,  Calif.),  and  the  ligation  mixture  was  used  as  a  template  for  PCR  using 
a  nested  mouse  Atel  primer  5'-CTGCAGCTGAGGCCTGCTGCATCCG-3' 
and  a  vector-specific  primer  5 ' -GTTTT CCCAGTC  ACGAC-3 ' .  This  strategy 
yielded  a  single  major  DNA  species  (data  not  shown).  We  then  applied  5 '-RACE 
(rapid  amplification  of  5'  cDNA  ends)  (3),  using  the  above  RT-PCR-derived 
sequence,  to  produce  the  full-length  Atel  cDNA  as  previously  described  (19). 

Analysis  of  the  mouse  Atel  gene.  The  mouse  genomic  DNA  from  L  cells  was 
used  as  a  template  for  PCR,  using  the  Expand  high-fidelity  PCR  system  (Boehr- 
inger,  Indianapolis,  Ind.)  and  exon-specific  primers  as  previously  described  (29), 
to  produce  DNA  fragments  that  together  spanned  ~4  kb  of  the  mouse  Atel  gene 
and  contained  the  two  alternative  129-bp  Atel  exons.  The  regions  encompassing 
exon/intron  junctions  were  sequenced  by  using  exon-  and  intron-specific  primers. 
Thereafter,  a  strategy  described  earlier  for  the  Ubrl  gene  (29)  was  used  to  screen, 
using  a  fragment  of  the  mouse  Atel  cDNA  (nt  255-1139),  a  BAC  (bacterial 
artificial  chromosome)-based  library  of  mouse  genomic  DNA  fragments  from 
strain  129SvJ  (Genome  Systems),  yielding  one  BAC  clone  containing  the  mouse 
Atel  gene. 

Isolation  of  the  human,  plant,  and  fly  ATE1  cDNAs.  Using  the  cloned  mouse 
ATElp  sequences  (see  above)  as  queries,  we  identified  in  GenBank  several 
significantly  similar  EST  sequences  from  other  organisms  (data  not  shown).  To 
determine  whether  these  species  also  contained  the  two  forms  of  ATE  1  mRNA, 
we  isolated  the  corresponding  A TE1  cDNAs.  RT-PCR  (3)  with  poly(A)+  RNA 
from  human  293  cells  and  the  primers  5'-CAATGGCATGTGGGCACATTCC 
ATG-3'  (specific  for  the  human  EST  clone  AA503372  [see  above])  and  5'-CC 
ACAGGTACTGAATATGTATCCTG-3'  (specific  for  the  human  EST  clone 
AA1 95361)  was  carried  out,  yielding  a  1.6-kb  human  ATE1  cDNA  fragment 
lacking  the  first  41  codons  of  th eATEl  ORF.  This  fragment  (a  mixture  of  the  two 
alternative  cDNAs)  was  subcloned  into  the  TA  vector  (Invitrogen)  and  se¬ 
quenced  on  both  strands.  Full-length  Atel  cDNAs  from  Arabidopsis  thaliana  and 
Drosophila  melanogaster  were  isolated  by  RT-PCR  as  well,  using  total  RNA  from 
A.  thaliana  leaves,  poly(A)+  RNA  from  D.  melanogaster  embryos,  and  primers 
specific  for  the  5'  and  3'  ends  of  the  corresponding  ORFs.  By  using  the  strategy 
described  above  for  the  human  ATE1  cDNAs,  the  sequences  of  these  primers 
were  derived  from  the  EST  clones  that  were  initially  identified  in  GenBank  through 
their  similarity  to  the  mouse  ATElp  sequence,  then  purchased  from  Genome  Sys¬ 
tems,  and  sequenced  prior  to  RT-PCR  with  the  corresponding  RNA  preparations. 
The  final  human,  plant,  and  fly  ATE1  cDNAs  were  sequenced  on  both  strands. 

Assays  of  p-gal.  Colony  assays  for  the  Escherichia  coli  {3-galactosidase  (Pgal) 
in  5.  cerevisiae  were  carried  out  by  overlaying  yeast  colonies  on  SG  plates  with 
0.5%  agarose  containing  0.1%  sodium  dodecyl  sulfate  (SDS),  4%  dimethylfor- 
mamide,  and  a  0.1-mg/ml  solution  of  the  chromogenic  |3gal  substrate  X-Gal 
(5-bromo-4-chloro-3-indolyl-(3-D-galactopyranoside;  Calbiochem,  La  Jolla,  Cal¬ 
if.),  followed  by  incubation  for  1  to  2  h  at  37°C.  Quantitative  assays  for  pgal  in 
S.  cerevisiae  were  carried  out  with  whole-cell  extracts,  using  another  chromogenic 
Pgal  substrate,  o-nitrophenyl-fi-D-galactopyranoside  (ONPG).  Cells  in  a  5-ml 
culture  (A6 00  of  ~1)  were  pelleted  by  centrifugation  and  resuspended  in  5  ml  of 
buffer  Z  (60  mM  Na2HP04,  40  mM  NaH2P04,  10  mM  KC1,  1  mM  MgS04,  50 
mM  (3-mercaptoethanol  [pH  7.0]).  After  the  A600  of  the  suspension  was  deter¬ 
mined  50-  or  100-jxl  samples  were  diluted  to  1  ml  with  buffer  Z;  0.1%  SDS  (20 
|xl)  and  CHC13  (50  pi)  were  then  added;  the  suspension  was  vortexed  for  10  to 
15  s  and  incubated  for  15  min  at  30°C,  followed  by  the  addition  of  200  pi  of 
ONPG  (4  mg/ml  in  buffer  Z)  and  further  incubation  at  30°C,  until  a  medium 
yellow  color  had  developed,  at  which  point  the  reaction  was  stopped  by  the 
addition  of  1  M  Na2C03  (0.4  ml).  The  mixture  was  centrifuged  for  5  min  at 
1,100  x  g,  and  they4420  andv4500  of  the  samples  were  measured.  The  ONPG  units 


(Uqnpg)  °f  Pgal  activity  were  calculated  as  follows:  Uqnpg  =  ls000  X  [(■^420) 
(1.75  X  A500)]/f  X  v  X  A600,  where  t  and  v  were,  respectively,  the  time  of 
incubation  (minutes)  and  the  sample  volume  (milliliters)  (3). 

Purification  and  N-terminal  sequencing  of  X-Pgal  proteins.  Extracts  were 
prepared  (using  the  liquid  nitrogen  procedure  [3])  from  S.  cerevisiae  AVY34 
{ubrl  A  atel  A)  co  transformed  with  a  pUB23-X  plasmid  (4)  (expressing  Ub-X- 
Pgal)  and  either  pATl  (expressing  mouse  ATEl-lp)  or  pAT2  (expressing  mouse 
ATEl-2p).  Cultures  were  grown  in  SG  to  an  A600  of  ~1.  Specific  X-pgal  proteins 
(X  =  Asp  or  Cys)  were  purified  by  affinity  chromatography  on  ProtoSorb  lacZ 
(Promega,  Madison,  Wis.),  a  monoclonal  anti-pgal  antibody  coupled  to  agarose 
beads  (Promega).  X-pgal  proteins  were  further  purified  by  electrophoresis  on 
SDS-7%  polyacrylamide  gels  and  were  electroblotted  onto  Immobilon-PSQ 
membranes  (Millipore,  Bedford,  Mass.).  N-terminal  sequencing  of  10  to  15  pmol 
of  electroblotted  X-f3gal  was  carried  out  for  at  least  five  cycles,  using  an  Applied 
Biosystems  476A  protein  sequencer  (Caltech  Micro  chemistry  Facility). 

Mouse  cell  cultures,  transfection,  and  GFP  localization.  NIH  3T3  cells  (ATCC 
1658-CRL)  were  grown  as  monolayers  in  Dulbecco’s  modified  Eagle  medium 
(GIBCO,  Frederick,  Md.)  supplemented  with  10%  fetal  bovine  serum.  Cells  for 
GFP  localization  analyses  were  grown  to  —15%  confluence  on  glass  coverslips 
for  24  h  prior  to  transfection  with  either  pATl-GFP  or  pAT2-GFP,  using  Lipo- 
fectamine  (GIBCO)  and  the  manufacturer-supplied  protocol.  Cells  were  incu¬ 
bated  for  5  h  at  37°C  in  serum-free  medium  containing  DNA  and  Lipofectamine. 
Thereafter  an  equal  volume  of  medium  containing  20%  serum  was  added,  and 
the  cells  were  grown  for  another  12  to  20  h  at  37°C.  Cells  were  fixed  with  2% 
formaldehyde  in  phosphate-buffered  saline,  and  GFP  fluorescence  was  visualized 
in  a  Zeiss  Axiophot  microscope. 

Northern  hybridization.  Mouse  multiple-tissue  Northern  blots  containing  2  fxg 
of  poly(A)+  RNA  per  lane  (Clontech)  were  probed  with  the  32P-labeled  1.1-kb 
mouse  Atel  cDNA  (nt  638  to  1734),  using  the  manufacturer-supplied  protocol. 

Determination  of  the  relative  levels  of  Atel-1  and  Atel-2  mRNAs.  Samples  of 
total  RNA  isolated  as  described  previously  (3)  from  mouse  spleen,  skeletal 
muscle,  liver,  brain,  testis,  and  embryonic  fibroblasts  were  subjected  to  RT-PCR 
(28  cycles).  The  primers  5 ' -C AGTGG AGG ATGCTGTT GACG GTGAC-3 '  and 
5'-GTGCTCTGCCTCCAATGGTGAGCTG-3'  were  specific  for  the  identical 
regions  of  Atel-1  and  Atel-2  cDNAs  that  flanked  the  two  129-bp  exons  (see 
Results)  which  distinguished  these  cDNAs.  The  resulting  624-bp  product  (a 
mixture  of  the  Atel-1  and  Atel-2  cDNA  fragments)  was  treated  with  ScrFI,  which 
cuts  at  different  sites  within  the  two  129-bp  exons,  followed  by  a  2%  agarose  gel 
electrophoresis.  This  procedure  made  it  possible  to  distinguish  the  Atel-1  and 
Atel-2  fragments.  The  ratios  of  the  two  forms  of  Atel  cDNA  were  determined  by 
serial  dilutions  of  the  samples  prior  to  gel  electrophoresis. 

Nucleotide  sequence  accession  numbers.  The  nucleotide  sequences  reported 
in  this  paper  were  submitted  to  the  GenBank/EMBL  data  bank  and  assigned 
accession  no.  AF079096  (mouse  Atel-1  cDNA),  AF079097  (mouse  Atel-2  cDNA), 
AF079098  (human  Atel-1  cDNA),  AF079099  (human  ATE1-2  cDNA),  AF079100 
(A.  thaliana  Atel  cDNA),  and  AF079101  (D.  melanogaster  Atel  cDNA). 


RESULTS 

Identification  of  mouse  Atel  cDNAs  by  species  walking.  On 

the  assumption  that  the  sequences  of  R-transferases  in  differ¬ 
ent  species  might  be  sufficiently  conserved  to  be  detected  by 
using  the  sequence  of  the  only  cloned  R-transferase,  S.  cerevi¬ 
siae  Atelp  (7),  we  have  been  searching  GenBank  and  related 
databases.  No  mammalian  sequences  in  GenBank,  including 
the  EST  sequences,  had  significant  similarities  to  S.  cerevisiae 
Atelp.  However,  we  did  identify  a  nematode  (C.  elegans )  ORF 
(accession  no.  Z21146)  that  exhibited  similarity  to  yeast  Atelp 
(Fig.  2B)  and  then  used  the  C.  elegans  sequence  to  identify  a 


FIG.  2.  Two  forms  of  the  mouse  Atel  cDNA  and  the  ATE  protein  family.  (A)  The  mouse  Atel-1  md  Atel-2  cDNAs  and  their  products.  The  nucleotide  sequences 
of  Atel-2  identical  to  those  of  Atel-1  (everywhere  except  for  the  129-bp  region)  are  indicated  by  dashes.  In  the  region  of  the  alternative  129-bp  exons  of  Atel-1  and 
Atel-2,  white-on-black  and  gray  shadings  highlight,  respectively,  identical  and  similar  residues.  The  circled  Cys  residues  are  homologous  to  those  that  are  important 
for  the  enzymatic  activity  of  S.  cerevisiae  Atelp  (32).  (B)  The  ATE  protein  family  and  the  origins  of  the  alternative  129-bp  exons.  Alignment  of  the  sequences  of  mouse 
ATEl-lp  (Mm-Atel-1),  N.  thaliana  Atelp  (At-Atel),  D.  melanogaster  Atelp  (Dm-Atel),  C.  elegans  Atelp  (Ce-Atel),  and  S.  cerevisiae  Atelp  (Sc-Atel)  (accession  no. 
J05404).  Similar  residues  (gray)  were  grouped  as  follows:  M,  L,  I,  and  V;  D,  E,  N,  and  Q;  R,  K,  and  H;  Y,  F,  and  W;  S,  A,  and  T.  The  region  encoded  by  the  alternative 
129-bp  exons  of  mouse  Atel  is  highlighted  by  a  thick  line.  Of  the  Cys  residues  that  are  conserved  among  all  ATE  proteins,  the  ones  required  and  not  required  for  the 
enzymatic  activity  of  5.  cerevisiae  Atelp  (32)  are  indicated,  respectively,  by  T  and  V.  The  N-terminally  truncated  mouse  ATEl-lp  and  ATEl-2p  proteins  which  began 
at  Met-42  (•)  lacked  the  R-transferase  activity  (data  not  shown).  The  highly  variable  C-terminal  regions  of  ATE  proteins  were  omitted  from  the  alignment.  The 
sequences  were  aligned  using  PileUp  program  (Wisconsin  Package;  Genetics  Computer  Group,  Madison,  Wis.).  Gaps  (— )  were  introduced  to  optimize  the  alignment. 
The  residue  numbers  are  on  the  right  of  the  sequences.  The  sequence  of  C.  elegans  Atelp  appears  to  lack  the  N-terminal  region  of  other  ATE  proteins  because  of  an 
error  in  defining  th t  Atel  ORF  in  the  genomic  DNA  sequence  (accession  no.  Z21146).  (C)  Alignment  of  the  43-residue  regions  that  are  encoded  by  the  alternative 
129-bp  exons  in  mammalian  Atel.  Sequences  shown:  mouse  (Mm-Atel-1  and  Mm-Atel-2),  human  (Hs-Atel-1  and  Hs-Atel-2),  D.  melanogaster  (Dm-Atel),  C.  elegans 
(Ce-Atel),  A.  thaliana  (At-Atel),  S.  pombe  (Sp-Atel;  accession  no.  Z99568),  and  S.  cerevisiae  (Sc-Atel)  (accession  no.  J05404).  The  degrees  of  identity  and  similarity 
of  ATE  proteins  to  the  deduced  amino  acid  sequences  of  the  M8  (mouse  ATEl-lp)  or  M3  (mouse  ATEl-2p)  exon  of  the  mouse  Atel  gene  are  indicated  on  the  right. 
The  residues  conserved  among  all  of  the  compared  sequences  are  indicated  above  the  alignment. 
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FIG.  3.  Expression  o f  Atel-1  and  AteJ-2  mRNAs  in  different  mouse  tissues 
and  mouse  embryonic  fibroblasts  (MEF).  Two  forms  of  Atel  cDNA  (Fig.  2A) 
were  amplified  in  a  single  reaction  by  RT-PCR,  using  the  same  primers,  to  yield 
624-bp  fragments  that  included  the  region  of  the  alternative  129-bp  exons  (see 
Materials  and  Methods).  The  624-bp  fragments  were  digested  with  ftvFI,  which 
produced  a  231-bp  fragment  (a  mixture  of  Atel-1  and  Atel-2),  a  284-bp,  Atel- 
/ -specific  fragment,  and  a  31 7-bp,  Atel-2 -specific  fragment.  The  untreated  (lanes 
a)  and  Sc/FI -treated  (lanes  b)  samples  from  different  mouse  tissues  were  ana¬ 
lyzed  by  electrophoresis  in  a  2%  agarose  gel.  The  ratio  of  the  two  forms  of  Atel 
mRNA,  defined  as  the  ratio  of  the  284-bp  (Atel-1)  fragment  to  the  317-bp 
(Atel-2)  fragment,  was  determined  by  analyzing  serially  diluted  samples  and 
comparing  the  resulting  band  intensities  (data  not  shown),  sk.,  skeletal. 


significantly  similar  EST  sequence  of  D.  melanogaster  cDNA 
(accession  no.  AA391570).  Finally,  using  the  deduced  amino 
acid  sequence  of  the  Drosophila  EST  clone  as  a  probe,  we 
identified  a  467-bp  mouse  EST  sequence  (accession  no. 
AA415294)  that  exhibited  weak  but  significant  similarity  to  the 
Drosophila  sequence  but  no  detectable  similarity  to  S.  cerevi - 
siae  Atelp.  On  a  chance  that  this  467-bp  EST  had  been  derived 
from  the  mouse  Atel  cDNA,  we  used  it  to  screen  a  mouse 
cDNA  library  and  indeed  isolated  the  putative  mouse  Atel 
cDNAs  (Fig.  2A). 

Alternative  splicing  results  in  two  species  of  mouse  Atel 
cDNA  containing  distinct  but  homologous  exons.  During  the 
initial  mouse  cDNA  library  screening,  we  found  that  the  cDNA 
clone  3  (nt  699  to  1870  of  Atel-2  cDNA)  was  identical  to  the 
cDNA  clone  8  (nt  587  to  2099  of  Atel-1  cDNA),  except  for  a 
129-bp  region  whose  deduced  amino  acid  sequences  were  sim¬ 
ilar  (31%  identity;  61%  similarity)  (Fig.  2).  The  two  full-length 
Atel  cDNAs  (termed  Atel -1  and  Atel-2),  which  were  obtained 
by  RT-PCR  followed  by  5 '-RACE  (see  Materials  and  Meth¬ 
ods),  encoded  proteins  of  identical  length,  516  residues  (59.2 
kDa  and  pi  of  8.14  versus  59.1  kDa  and  pi  of  7.22)  (Fig.  2A), 
that  contained  regions  of  similarity  to  the  57.8-kDa  Atelp  of  S. 
cerevisiae  (Fig.  2B).  RT-PCR  (followed  by  subcloning)  with 
RNAs  from  different  mouse  tissues  also  produced  the  two 
forms  of  Atel  cDNAs,  indicating  that  the  two  species  were  in 
fact  present  in  the  initial  RNA  preparation  (Fig.  3  and  data  not 
shown). 

To  determine  whether  both  of  the  two  129-bp  regions  of  the 
Atel~l  and  Atel-2  cDNAs  were  a  part  of  the  Atel  gene,  and 
whether  Atel-1  and  Atel-2  were  produced  through  alternative 
splicing,  we  analyzed  the  mouse  Atel  gene  in  the  vicinity  of  its 
two  129-bp  exons,  using  at  first  PCR  and  subsequently  a  BAC 
clone  containing  Atel  (see  Materials  and  Methods).  The  two 
129-bp  exons  were  located  next  to  each  other  in  the  Atel  gene 
(Fig.  4A).  We  also  found  that  the  12-bp  sequences  around  the 
splice  acceptor  sites  of  these  exons  (6  bp  in  the  intron  and  6  bp 
in  the  exon)  were  identical  between  the  two  exons  (Fig.  4B), 
consistent  with  the  alternative  presence  of  these  exons  in  the 
mature  Atel  mRNA.  The  exon-containing  RT-PCR  products 
from  different  mouse  tissues  appeared  as  a  single  major  band 
retaining  one  of  the  two  129-bp  exons  (Fig.  3  and  data  not 
shown).  Subcloning  and  analyses  of  these  RT-PCR  products 
yielded  no  other  differentially  spliced  Atel  cDNAs  (for  exam¬ 
ple,  cDNAs  retaining  both  or  neither  of  the  two  129-bp  exons), 


FIG.  4.  The  two  forms  of  mouse  Atel  mRNA  are  produced  by  alternative 
splicing.  (A)  The  two  alternative  129-bp  exons  are  adjacent  in  the  mouse  Atel 
gene.  The  thick  line  denotes  genomic  DNA;  the  striped  and  black  rectangles 
denote  the  alternative  129-bp  exons,  M8  and  M3  (see  Materials  and  Methods); 
gray  rectangles  denote  the  flanking  Atel  exons,  of  unknown  sizes;  thin  lines 
denote  the  alternative  splicing  patterns  that  yield  the  two  forms  of  Atel  mRNA. 
(B)  The  underlined  12  bp  (6  bp  in  the  intron  and  6  bp  in  the  exon)  around  the 
splice  acceptor  sites  are  identical  between  the  two  alternative  129-bp  exons.  (C) 
Scale  diagrams  of  the  two  forms  of  mouse  Atel  cDNAs.  The  alternative  129-bp 
exons  M8  and  M3  arc  indicated  by  the  striped  and  black  boxes,  respectively. 


suggesting  that  the  splicing  of  Atel  pre-mRNA  is  tightly  regu¬ 
lated  to  retain  one  and  only  one  of  the  two  alternative  exons. 
Thus,  the  two  forms  of  Atel  mRNAs  are  produced  by  a  nearly 
unprecedented  (see  Discussion)  splicing  pathway  which  ulti¬ 
mately  yields  two  proteins  of  identical  size  that  bear  two  alter¬ 
native,  homologous  but  distinct  43-residue  internal  sequences 
(Fig.  4C). 

The  absence  of  alternative  129-bp  exons  from  the  plant  and 
fly  Atel  genes.  To  explore  the  evolution  of  Atel,  and  especially 
the  phylogeny  of  its  alternative  129-bp  exons,  we  cloned  the 
human,  plant  (A.  thaliana),  and  fly  ( D .  melanogaster)  ATE1 
cDNAs  (see  Materials  and  Methods).  Two  forms  of  the  human 
ATE1  cDNA,  termed  Hs-ATEl-1  and  Hs-ATEl-2 ,  were  iso¬ 
lated  from  human  293  cells  (the  forms’  molar  ratio  was  about 
1).  However,  only  one  form  of  the  Atel  cDNA  was  isolated 
from  either  the  leaves  of  A.  thaliana  (termed  At-Atel)  or  D. 
melanogaster  embryos  (termed  Dm-Atel),  suggesting  that  the 
alternative  129-bp  exons  may  not  be  present  in  the  Atel  genes 
of  plants  and  arthropods.  The  A .  thaliana  and  D.  melanogaster 
Atelp  proteins  were,  respectively,  629  and  All  residues  long 
(71  and  55  kDa,  with  pis  of  6.0  and  8.4).  Mouse  ATEl-lp  was 
82,  38,  and  42%  identical  (as  well  as  91,  57,  and  61%  similar) 
to  human  ATEl-lp,  A.  thaliana  Atelp,  and  D.  melanogaster 
Atelp,  respectively  (Fig.  2  and  data  not  shown).  A.  thaliana 
Atelp  bore  a  16-residue  region  containing  exclusively  Asp  or 
Glu  (data  not  shown). 

We  used  RT-PCR  and  RNA  preparations  from  A.  thaliana 
and  D.  melanogaster  to  amplify  the  relevant  regions  of  the 
corresponding  Atel  cDNAs.  The  resulting  fragments  were  di¬ 
gested  with  restriction  enzymes  that  recognize,  in  each  species, 
exclusively  the  region  that  corresponds  to  the  129-bp  exons  of 
the  mouse  Atel  cDNAs,  and  the  products  were  analyzed  by  gel 
electrophoresis.  The  initial  cDNA  fragments  of  A  thaliana  and 
D.  melanogaster  Atel  completely  disappeared  after  this  treat¬ 
ment,  in  contrast  to  the  homologous  mouse  cDNA  fragment 
(which  contained  two  distinct  sequences  of  identical  length), 
suggesting  that  the  two  alternative  exons  were  absent  from  the 
Atel  genes  of  plants  and  arthropods  (Fig.  5A). 

While  this  analysis  was  under  way,  complete  sequences  of 
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FIG.  5.  Alternative  splicing  of  Atel  pre-mRNA  in  mammals  (the  mouse)  but  in  neither  plants  (A.  thaliana)  nor  arthropods  {D,  melanogaster).  (A)  The  relevant  Atel 
cDNA  fragments  from  mouse  (794  bp).  A.  thaliana  (626  bp),  and  D.  melanogaster  (856  bp)  were  produced  by  RT-PCR  (see  Materials  and  Methods).  The  products  were 
treated  with  the  indicated  restriction  endonucleases  that  cut  exclusively  within  the  two  alternative  129-bp  exons  of  mouse  Atel  cDNAs  (RspMI  for  Atel-2;  SerAI  for 
Atel-1)  or  within  the  corresponding  regions  of  A.  thaliana  ( Pvull  and  ScrFI)  and  D.  melanogaster  (Acc I  and  EcoRV)  Atel  cDNAs.  (B)  The  A.  thaliana  and  D. 
melanogaster  Atel  genes.  The  exon-intron  organization  of  these  genes  was  deduced  through  comparisons  of  their  cDNA  sequences,  determined  in  this  work  (see 
Materials  and  Methods),  with  the  concurrently  determined  sequences  of  the  corresponding  genomic  DNA  regions  (see  text).  The  horizontal  lines  and  rectangles  denote, 
respectively,  introns  and  exons,  whose  lengths  are  indicated  below  and  above  the  line  denoting  introns.  Thick  horizontal  lines  indicate  the  regions  of  A.  thaliana  and 
D.  melanogaster  cDNAs  that  correspond  to  the  alternative  129-bp  exons  of  the  mouse  and  human  Atel  cDNAs  (Fig.  2  and  4).  The  lengths  of  the  A.  thaliana  and  D. 
melanogaster  Atel  genes  are,  respectively,  —3  and  ~2.5  kb. 


the  A .  thaliana  and  D.  melanogaster  Atel  loci,  determined 
through  the  corresponding  sequencing  projects,  were  depos¬ 
ited  in  GenBank  (accession  no.  AA005237  and  accession  no. 
AC004321,  respectively).  By  comparing  the  cloned  Atel  cDNAs 
(see  Materials  and  Methods)  and  the  corresponding  genomic 
sequences  of  A.  thaliana  and  D.  melanogaster ,  we  could  deduce 
the  organization  of  these  Atel  genes.  The  results  (Fig.  5B) 
directly  confirmed  the  absence  of  the  alternative  homologous 
exons  horn  Atel  of  A.  thaliana  and  D.  melanogaster ,  in  contrast 
to  mammalian  Atel.  The  corresponding  region  of  plant  Atelp 
is  more  similar  to  the  exon-encoded  sequence  of  mouse  ATE1- 
lp,  whereas  in  Drosophila  this  region  is  more  similar  to  the 
alternative  sequence  of  ATEl-2p  (Fig.  2C).  The  corresponding 
regions  of  S.  cerevisiae  and  Schizosaccharomyces  pombe  Atelp 
are  not  preferentially  similar  to  either  of  the  two  alternative 
exon-encoding  sequences  of  mouse  ATElp  (Fig.  2C). 

Mouse  ATEl-lp  and  ATEl-2p  can  implement  the  Asp/Glu- 
specific  subset  of  the  N-end  rule  pathway  but  differ  in  activity. 
To  determine  whether  the  two  putative  mouse  R-transferases 
are  in  fact  R-transferases  and  to  compare  their  activities  in  an 
in  vivo  setting,  we  examined  whether  ATEl-lp  and  ATEl-2p 
could  confer  metabolic  instability  on  Asp-|3gal  and  Glu-|3gal  in 
atel  A  S.  cerevisiae.  Asp  and  Glu  are  secondary  destabilizing 
residues  in  the  N-end  rule  (Fig.  1  and  introduction).  The  test 
substrates  Asp-pgal  and  Glu-pgal  (produced  through  cotrans- 
lational  deubiquitylation  of  Ub-Asp-pgal  and  Ub-Glu-pgal 
(4))  are  short-lived  in  wild-type  yeast  (half-lives  of  ~3  and  —30 
min,  respectively)  but  long-lived  (half-life  of  >20  h)  in  atel  A  S . 
cerevisiae  that  lacks  the  ATE1 -encoded  yeast  R-transferase  (5, 
7).  Previous  work  (19,  33)  has  shown  that  the  steady-state  level 
of  an  X-pgal  protein  is  a  sensitive  measure  of  its  metabolic 
stability. 

S.  cerevisiae  atel  IS  cells  were  co transformed  with  a  pair  of 


plasmids  that  expressed  one  of  the  two  putative  mouse  R- 
transferases,  ATEl-lp  or  ATEl-2p,  and  one  of  several  test 
substrates  (as  the  corresponding  Ub  fusions):  Asp-pgal,  Glu- 
pgal,  Arg-pgal,  Cys-pgal,  or  Met-pgal.  Met  and  Cys  are  stabi¬ 
lizing  residues  in  the  yeast  N-end  rule;  Arg  is  a  primary  desta¬ 
bilizing  residue;  Asp  and  Glu  are  secondary  destabilizing 
residues  (5,  56).  Control  tests  included  either  the  vector  alone 
or  a  plasmid  expressing  S.  cerevisiae  Atelp.  The  steady-state 
levels  of  X-pgal  proteins  were  determined  by  measuring  the 
enzymatic  activity  of  pgal  in  yeast  extracts.  Using  this  assay,  we 
found  that  both  forms  of  mouse  ATElp  were  able  to  confer 
metabolic  instability  on  either  Asp-Pgal  or  Glu-pgal  in  atel  IS  S. 
cerevisiae  (Fig.  6A).  ATEl-lp  and  ATEl-2p  destabilized  Glu- 
pgal  much  less  than  Asp-pgal  (Fig.  6A),  consistent  with  Glu 
being  a  less  destabilizing  residue  in  the  N-end  rule  than  Asp, 
presumably  because  of  less  efficient  arginylation  of  the  N- 
terminal  Glu  by  R-transferases  (54).  However,  while  the  ap¬ 
parent  destabilizing  activity  of  the  mouse  ATEl-lp  R-trans¬ 
ferase  was  only  slightly  lower  than  that  of  S.  cerevisiae  Atelp 
(expressed  from  the  identical  vector  and  promoter),  the  activ¬ 
ity  of  mouse  ATEl-2p  was  significantly  lower  than  that  of 
ATEl-lp  (Fig.  6A). 

We  also  asked  whether  the  two  forms  of  mouse  ATElp 
could  influence  each  other’s  activity  if  they  were  coexpressed  in 
the  same  cell  (such  an  influence  might  be  expected,  for  in¬ 
stance,  if  the  active  form  of  R-transferase  were  a  dimer  or  if 
the  two  forms  of  R-transferase  competed  for  binding  to  the 
same  component  of  a  pathway).  S.  cerevisiae  atel  IS  cells  were 
cotransformed  with  two  plasmids  bearing  different  selectable 
markers  and  expressing  different  combinations  of  ATEl-lp 
and  ATEl-2p  (1  +  1,  2+2,  or  1+2),  and  also  with  a  plasmid 
expressing  one  of  the  X-pgal  test  proteins  (X  =  Met,  Arg,  Cys, 
Asp,  or  Glu).  Control  cells  were  co  transformed  with  the  two 
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FIG  6  The  two  forms  of  mouse  ATElp  can  implement  the  Asp/Glu-specific  subset  of  the  N-end  rule  pathway.  (A)  Relative  enzymatic  activities  of  (Jgal  in  ateJ  A 
S.  i^rei'i.rf(ieTransformed  with  plasmids  expressing  X-f3gal  (as  Ub-X-jjgal)  test  proteins  (X  =  Met,  Arg,  Cys,  Asp,  or  Glu) 

ATEJ  (denoted  as  y),  mouse  ATEl-lp  (denoted  as  1),  mouse  ATEl-2p  (denoted  as  2),  or  the  vector  alone  (denoted  as  v).  The  N-termmal  residues  of  A-Pgals  in  ^ 
set  of  experiments  are  indicated  at  the  top.  The  activity  of  Met+gal  in  cells  transformed  with  vector  alone  is  taken  as  100%.  (B)  The  two  forms  of  mouse  ATElp  exhibit 
no  cooperativity  in  mediating  the  degradation  of  X+gals.  Shown  are  relative  enzymatic  activities  of  X-Pgals  m  ateJ  A  5.  cerevisiae  strain  cotransformed  with  plasmids 
expressing  X-Bgals  (Ub-X-Bgals)  (X  =  Met,  Arg,  Cys,  Asp,  or  Glu)  and  the  combinations  of  plasmids  expressing  the  following  proteins,  mouse  ATEl-lp  and  ATE1  lp 
(1+1)  ATEl-lp  and  ATEl-2p  (1+2),  ATEl-2p  and  ATEl-2p  (2+2),  or  two  vector  controls  (v+v).  One  of  the  two  vectors  bore  the  TRP1  marker  and  the  other  bore 
the  HIS3  marker  (see  Materials  and  Methods).  Results  are  averages  of  four  independent  measurements,  which  differed  by  less  than  10 +. 


vectors  alone.  The  results  (Fig.  6B)  indicated  that  the  total 
activities  of  the  1  +  1  and  1+2  combinations  (measured  as  the 
extent  of  destabilization  of  Asp-pgal  or  Glu-Pgal)  were  similar 
to  each  other  and  much  higher  than  the  total  activities  of  2+2 
(Fig.  6B),  consistent  with  the  conjecture  that  the  two  forms  of 
mouse  R-transferase  do  not  interact  and  that  ATEl-lp  is  a 
more  (possibly  much  more)  active  enzyme  than  ATEl-2p. 

Neither  ATEl-lp  nor  ATEl-2p  confers  metabolic  instability 
on  Cys-pgal.  Cysteine  is  a  stabilizing  residue  in  the  yeast  N-end 
rule  but  a  secondary  destabilizing  residue  in  multicellular  or¬ 
ganisms  such  as  mammals  and  amphibians  (14, 18,  30,  54).  The 
presence  of  two  alternative  regions  in  the  two  forms  of  mouse 
R-transferase  (Fig.  4C)  initially  suggested  that  one  R-trans- 
ferase  might  be  specific  for  N-terminal  Asp  and  Glu,  with  the 
other  specific  for  Cys.  However,  Cys-pgal,  which  is  long-lived 
in  wild-type  S.  cerevisiae  (5),  remained  long-lived  in  the  pres¬ 
ence  of  either  ATEl-lp  or  ATEl-2p  (Fig.  6A).  This,  finding 
and,  more  directly,  the  results  of  amino  acid  sequencing  (see 
below)  suggest  the  existence  of  a  mammalian  tRNA-depen- 
dent  enzyme  (presumably  a  distinct  R-transferase)  (18)  that 
mediates  destabilizing  activity  of  N-terminal  Cys. 


Mouse  ATEl-lp  and  ATEl-2p  destabilize  Asp-pgal  and 
Glu-pgal  through  arginylation  of  their  N-terminal  residues. 

To  verify  directly  that  mouse  ATEl-lp  and  ATEl-2p  in  fact 
possess  the  R-transferase  activity,  we  constructed  the  atelEs 
ubrlts  S.  cerevisiae  double  mutant  AVY34,  which  lacked  both 
R-transferase  and  N-recognin  (E3),  the  main  recognition  com¬ 
ponent  of  the  N-end  rule  pathway  (see  Materials  and  Meth¬ 
ods).  Consequently,  N-terminal  arginylation  of  a  test  protein  in 
this  mutant  by  an  exogenous  R-transferase  would  not  result  in 
degradation  of  the  protein,  thereby  making  it  possible  to  iso¬ 
late  enough  of  the  test  protein  for  N-terminal  sequencing. 
Strain  AVY34  was  transformed  with  pUB23-D  (expressing 
Ub-Asp-pgal)  and  also  with  either  pATl  (expressing  ATEl- 
lp),  pAT2  (expressing  ATEl-2p),  or  vector  alone  and  was 
grown  in  SG  medium.  Asp-Pgal  proteins  isolated  from  these 
transformants  were  subjected  to  N-terminal  sequencing  (see 
Materials  and  Methods).  The  results  (Table  1)  directly  con¬ 
firmed  that  both  ATEl-lp  and  ATEl-2p  possessed  R-trans- 
ferase  activity.  In  agreement  with  the  finding  that  ATEl-lp 
was  more  active  than  ATEl-2p  in  destabilizing  Asp-Pgal  in 
vivo  (Fig.  6A),  Asp-Pgal  from  cells  expressing  ATEl-lp  was 
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TABLE  1.  N-terminal  sequencing  of  X-J3gal  proteins  isolated  from 
atelA  ubrl  A  S.  cerevisiae  expressing  different  R-transferases 


Substrate 

Coexpressed 

protein 

N-terminal 

sequence 

Yield  (%) 

D-eK-pgal 

d-h-g-s-a- 

D-eK-3gal 

Vector  alone 

D-H-G-S-A- 

-100 

D-eK-3gal 

Mouse  ATEl-lp 

R-D-H-G-S-A- 

-100 

D-eK-pgal 

Mouse  ATEl-2p 

R-D-H-G-S-A- 

-50 

D-H-G-S-A- 

-50 

C-eK-3gal 

C-H-G-S-A- 

C-eK-pgal 

Vector  alone 

C-H-G-S-A- 

-20 

C-eK-3gal 

Mouse  ATEl-lp 

C-H-G-S-A- 

-20 

C-eK-(3gal 

Mouse  ATEl-2p 

C-H-G-S-A- 

-20 

found  to  be  completely  arginylated,  whereas  Asp-|8gal  from 
cells  expressing  ATEl-2p  was  arginylated  to  approximately 
50%  (Table  1). 

We  also  determined,  using  the  above  procedure,  whether 
mouse  ATEl-lp  or  ATEl-2p  could  arginylate  N-terminal  Cys. 
Approximately  80%  of  Cys-(3gal  isolated  from  atelA  ubrl  A  S. 
cerevisiae  was  found  to  be  N-terminally  blocked,  presumably 
acetylated  (Table  1).  However,  the  rest  of  Cys-pgal  (—20%) 
bore  the  N-terminal  sequence  beginning  with  Cys  and  lacking 
N-terminal  Arg,  in  agreement  with  the  results  of  the  in  vivo 
Cys-3gal  degradation  assays  (Fig.  6  and  Table  1).  Thus,  both 
ATEl-lp  and  Atel-2p  are  apparently  unable  to  utilize  N- 
terminal  Cys  as  a  substrate  in  5.  cerevisiae. 

ATEl-2p  is  exclusively  cytosolic,  whereas  ATEl-lp  is 
present  in  either  the  nucleus  or  the  cytosol.  To  determine  the 
intracellular  location  of  the  two  forms  of  mouse  R-transferase, 


we  constructed  fusions  to  the  N  terminus  of  GFP  and  tran¬ 
siently  expressed  them  in  NIH  3T3  cells.  Whereas  the  free 
26-kDa  GFP  was  located  in  both  the  nucleus  and  the  cytosol 
(data  not  shown),  the  85-kDa  Atel-2p-GFP  fusion  was  exclu¬ 
sively  cytosolic  in  all  of  the  many  transfected  cells  examined 
(Fig.  7a  to  c).  In  contrast,  the  85-kDa  ATEl-lp-GFP  (the 
alternative  form  of  R-transferase  that  is  much  more  active 
enzymatically  than  ATEl-2p)  was  found  to  be  localized  differ¬ 
ently  in  different  cells  on  the  same  coverslip,  possibly  depend¬ 
ing  on  their  cell  cycle  position  and/or  metabolic  state.  Specif¬ 
ically,  in  —50%  of  the  transfected  cells,  ATEl-lp-GFP  was 
exclusively  cytosolic  (Fig.  7d  and  e),  as  was  ATEl-2p-GFP 
(Fig.  7a  to  c),  but  in  the  other  -50%  of  cells,  ATEl-lp-GFP 
was  present  in  the  nucleus  as  well  and,  moreover,  appeared  to 
be  significantly  enriched  in  the  nucleus  (Fig.  If  and  g).  Thus, 
the  two  43-residue  alternative  regions  in  ATEl-lp  and 
ATEl-2p  confer  overlapping  but  nonidentical  intracellular  dis¬ 
tributions  on  the  respective  R-transferases. 

While  the  nonuniformity  of  the  ATEl-lp-GFP  localization 
among  mouse  cells  in  a  single  culture  remains  to  be  under¬ 
stood,  its  preferential  location  in  the  nuclei  of  some  cells  is 
consistent  with  a  high  content  of  basic  residues  in  its  43-residue 
region,  in  comparison  to  the  alternative  homologous  region  of 
ATEl-2p  (Fig.  2C).  (No  sequences  fitting  the  consensus  se¬ 
quences  of  known  nuclear  localization  signals  could  be  de¬ 
tected  in  the  43-residue  region  of  ATEl-lp).  In  contrast  to 
mouse  R-transferases,  S.  cerevisiae  Atelp  was  shown  to  be 
located  predominantly  in  the  nuclei  of  yeast  cells  (56a). 

The  ratio  of  Ate  1-1  to  Ate  1-2  mRNA  varies  greatly  among 
mouse  tissues.  Northern  hybridization,  using  the  1.1-kb  mouse 
Atel  cDNA  fragment  (nt  638  to  1734)  as  a  probe,  detected  a 


FIG.  7.  Intracellular  localization  of  mouse  ATEl-lp  and  ATEl-2p.  Shown  are  green  (GFP)  fluorescence  (a  to  d  and  f)  and  phase-contrast  (e  and  g)  micrographs 
of  mouse  NIH  3T3  cells  transiently  transfected  with  ATEl-lp-GFP  (d  to  g)  or  ATEl-2p-GFP  (a  to  c)  fusion  proteins  (see  Materials  and  Methods).  Panels  a  to  c  show 
different  examples  of  the  exclusively  cytosolic  localization  of  ATEl-2p.  Regions  around  the  nucleus  and  in  the  lamellar  protrusions  at  the  edges  of  a  cell  (c)  exhibit 
higher  GFP  fluorescence,  possibly  because  of  a  greater  thickness  of  cells  in  these  areas.  Panels  d  plus  e  and  f  plus  g  show  pairs  of  GFP  fluorescence  and  phase-contrast 
pictures  of  cells  that  express  ATEl-lp-GFP.  The  cell  in  panels  d  and  e  shows  ATEl-lp-GFP  in  the  cytosol  but  not  in  the  nucleus.  Cells  in  panels  f  and  g  contain 
ATEl-lp-GFP  in  both  the  cytosol  and  the  nucleus,  the  latter  being  apparently  enriched  in  ATEl-lp-GFP. 
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FIG.  8.  Northern  hybridization  analyses  of  mouse  Atel  mRNA.  The  North¬ 
ern  blots  of  mRNA  from  different  mouse  tissues  were  probed  with  an  Atel  cDNA 
fragment  (nt  638  to  1734)  which  can  hybridize  to  both  forms  of  Atel  mRNA.  A 
mouse  (3-actin  cDNA  probe  was  used  for  comparing  the  total  RNA  loads  as 
described  previously  (19).  The  same  blot  was  also  hybridized  with  mouse  Ubrl 
(29)  (sec  the  introduction).  The  apparent  absence  of  Atel  and  Ubrl  mRNAs 
from  the  spleen  is  an  artifact  of  RNA  degradation  in  this  lane  of  the  blot  (data 
not  shown).  Also  shown  are  the  results  of  analogous  Northern  hybridizations  of 
the  mouse  Ntanl  cDNA,  encoding  the  Asn-specific  Nt-amidasc  (Fig.  1A),  and 
E2J4K,  encoding  the  relevant  Ub-conjugating  (E2)  enzyme  (19).  The  approxi¬ 
mate  sizes  of  transcripts  are  indicated  on  the  right. 


single  ~5.0-kb  transcript  (a  mixture  of  the  Atel-1  and  Atel-2 
mRNAs)  in  all  of  the  mouse  tissues  examined  except  the  testis, 
where  the  ~5-kb  Atel  mRNA  was  a  minor  one,  the  major 
species  being  ~2  kb  (Fig.  8).  Both  ATElp  and  the  other 
targeting  components  of  the  mammalian  N-end  rule  pathway 
are  expressed  ubiquitously  (at  various  levels),  and  the  testis- 
specific  patterns  of  transcripts  are  characteristic  for  all  of  them 
as  well  (Fig.  8).  The  existence  of  the  Y-chromosome-encoded, 
testis-specific  variant  of  the  Ub-activating  (El)  enzyme  (27, 35) 
suggests  that  the  testis-specific  modifications  of  the  N-end  rule 
pathway  may  be  functionally  relevant  in  spermatogenesis. 

To  determine  the  ratio  of  Atel-1  to  Atel-2  mRNA  in  differ¬ 
ent  mouse  tissues  or  cells  in  culture,  we  employed  RT-PCR, 
using  sequence  differences  between  the  two  alternative,  ho¬ 
mologous  129-bp  exons  to  distinguish  between  them  (see  Ma¬ 
terials  and  Methods)  (Fig.  3).  Approximately  equal  amounts  of 
Atel-1  and  Atel-2  mRNAs  were  present  in  mouse  embryonic 
fibroblasts  and  in  human  293  cells  in  culture  (Fig.  3  and  data 
not  shown).  However,  the  molar  ratio  of  Atel-1  to  Atel-2 
mRNA  was  found  to  vary  greatly  among  the  mouse  tissues:  it 
was  —0.1  in  the  skeletal  muscle,  —0.25  in  the  spleen,  —3.3  in 
the  liver,  and  brain,  and  —10  in  the  testis  (Fig.  3).  Thus,  while 
the  total  expression  of  Atel  ( Atel-1  plus  Atel-2)  varies  by  2-  to 
4-fold  among  mouse  tissues  (Fig.  8),  the  difference  in  expres¬ 
sion  levels  between  Atel-1  and  Atel-2  mRNAs  can  be  as  high 
as  a  100-fold  (the  skeletal  muscle  versus  the  testis)  (Fig.  3), 
suggesting  that  the  two  forms  of  R-transferase  may  be  func¬ 
tionally  distinct. 


DISCUSSION 

The  N-end  rule  pathway  is  one  of  several  proteolytic  path¬ 
ways  of  the  Ub  system  (23,  54,  55).  Among  the  targets  of  the 
N-end  rule  pathway  are  proteins  that  bear  destabilizing  N- 
terminal  residues.  In  the  yeast  S.  cerevisiae ,  Asn  and  Gin  are 
tertiary  destabilizing  N-terminal  residues  in  that  they  function 
through  their  conversion,  by  a  specific  amidase  (6),  to  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the  ATEL -encoded  R-transferase,  to  Arg,  one 
of  the  primary  destabilizing  residues  (7)  (Fig.  IB).  In  mam¬ 
mals,  the  set  of  secondary  destabilizing  residues  contains  not 
only  Asp  and  Glu  but  also  Cys,  which  is  a  stabilizing  residue  in 
yeast  (18,  54)  (Fig.  1). 

In  this  work,  we  isolated  cDNA  encoding  the  mouse  R- 
transferase,  ATElp,  and  found  that  this  enzyme  exists  in  two 
forms,  termed  ATEl-lp  and  ATEl-2p,  which  differ  by  con¬ 
taining  one  of  the  two  alternative,  homologous  43-residue  re¬ 
gions.  The  two  516-residue  R-transferases  are  produced  from 
the  mouse  Atel  gene  by  a  pathway  of  alternative  splicing  that 
retains  one  or  the  other  of  the  two  homologous  129-bp  exons. 
The  presence  of  two  adjacent,  homologous,  equal-length,  and 
alternatively  utilized  exons  in  a  gene  (Fig.  4)  is  nearly  unprec¬ 
edented.  To  our  knowledge,  just  one  such  case  was  described 
previously:  the  mouse  kE2  enhancer-binding  protein  E12/E47 
(37).  The  two  KE2-binding  proteins,  E12  and  E47,  are  pro¬ 
duced  through  a  switch  between  two  alternative,  equal-length 
exons,  resulting  in  two  helix-loop-helix  DNA-binding  proteins 
that  differ  in  the  ability  to  homodimerize.  Specifically,  E47  can 
bind  to  the  kE2  enhancer  either  as  a  homodimer  or  as  a 
heterodimer  with  MyoD,  whereas  E12  can  bind  as  a  het¬ 
erodimer  with  MyoD  but  not  as  a  homodimer  (37). 

We  report  the  following  major  findings. 

(i)  Identification,  through  species  walking,  and  isolation  of 
the  mouse  cDNA  encoding  R-transferase  (or  ATElp)  have 
shown  that  mammalian  ATElp  exists  in  two  forms,  ATEl-lp 
and  ATEl-2p,  which  differ  exclusively  by  one  of  the  two  alter¬ 
native,  homologous  43-residue  regions  (Fig.  2A). 

(ii)  The  corresponding  alternative  129-bp  exons  are  adjacent 
in  the  mouse  Atel  gene.  Moreover,  the  12-bp  sequences 
around  the  splice  acceptor  sites  of  these  exons  (6  bp  in  the 
intron  and  6  bp  in  the  exon)  are  identical  between  the  two 
exons  (Fig.  4).  The  splicing  of  Atel  pre-mRNA  proceeds  in 
such  way  that  one,  and  only  one,  of  the  alternative  129-bp 
exons  is  always  retained  in  the  mature  Atel  mRNA. 

(iii)  The  human  ATE1  gene  also  contains  the  two  alternative 
129-bp  exons,  whereas  the  plant  (A.  thaliana)  and  fly  ( D .  mela- 
nogaster)  Atel  genes  encode  a  single  form  of  ATElp  (Fig.  2 
and  5).  The  corresponding  43-residue  regions  are  significantly 
similar  among  all  of  the  sequenced  R-transferases,  from  S. 
cerevisiae  to  mammals  (Fig.  2C).  The  set  of  Atel  genes  from 
mammals  to  yeast  defines  a  distinct  family  of  proteins,  the 
ATE  family.  The  splicing-derived  alternative  forms  of  R-trans¬ 
ferase  have  evolved  apparently  after  the  divergence  of  the 
arthropod  and  vertebrate  lineages. 

(iv)  Expression  of  the  mouse  Atel-1  and  Atel-2  cDNAs  in 
atelEs  S .  cerevisiae ,  and  N-terminal  sequencing  of  isolated 
X-pgal  test  proteins,  was  used  to  show  that  ATEl-lp  and 
ATEl-2p  could  implement  the  Asp/Glu-specific  subset  of  the 
N-end  rule  pathway  and  that  they  did  so  through  the  argin- 
ylation  of  N-terminal  Asp  or  Glu  in  the  test  substrates  (Fig.  6A 
and  Table  1). 

(v)  While  the  destabilizing  activity  of  the  mouse  ATEl-lp 
R-transferase  is  only  slightly  lower  than  that  of  S.  cerevisiae 
R-transferase,  the  activity  of  mouse  ATEl-2p  is  significantly 
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(possibly  considerably)  lower  than  that  of  ATEl-lp.  This  con¬ 
clusion  follows  also  from  a  comparison  of  the  N-terminal  ar- 
ginylation  of  Asp-pgal  by  the  two  R-transferases  (Table  1). 
The  results  of  coexpressing  mouse  ATEl-lp  and  ATEl-2p  in 
the  same  atel  A  yeast  cells  were  consistent  with  the  conjecture 
that  R-transferase  functions  as  a  monomer  (Fig.  6B). 

(vi)  Neither  ATEl-lp  nor  ATEl-2p  could  confer  instability 
on  (or  arginylate)  Cys-pgal  in  atel  A  S.  cerevisiae  (Fig.  6A  and 
Table  1).  Cys  is  a  stabilizing  residue  in  yeast  but  a  secondary 
destabilizing  residue  in  the  mammalian  N-end  rule  (54).  A 
distinct  Cys-specific  mammalian  R-transferase  suggested  by 
these  data  remains  to  be  identified. 

(vii)  Mouse  ATEl-2p  (tested  as  a  GFP  fusion)  was  exclu¬ 
sively  cytosolic  in  mouse  3T3  cells,  whereas  ATEl-lp  was  lo¬ 
calized  differentially  in  different  cells  of  the  same  (unsynchro¬ 
nized)  culture:  it  was  either  exclusively  cytosolic  or  present  in 
both  the  cytosol  and  the  nucleus  (Fig.  7). 

(viii)  Mouse  Atel  is  a  ubiquitously  expressed  gene.  A  single 
— 5-kb  mRNA  was  present  in  all  of  the  tissues  examined  except 
the  testis,  where  the  major  Atel  transcript  was  —2  kb  in  length 
(Fig.  8).  The  testis-specific  differential  expression  patterns  are 
also  characteristic  of  the  other  targeting  components  of  the 
mammalian  N-end  rule  pathway,  such  as  the  Ntanl -encoded 
Asn-specific  Nt-amidase  and  the  Ubrl -encoded  N-recognin 
(E3a)  (19,  29). 

(ix)  The  molar  ratio  of  Ate  1-1  to  Ate  1-2  mRNA  varies  up  to 
a  100-fold  among  different  mouse  tissues  (Fig.  3  and  Results), 
suggesting  a  functional  significance  of  the  difference  between 
the  two  R-transferases. 

The  region  of  ATElp  that  corresponds  to  the  two  129-bp 
mammalian  Atel  exons  has  been  significantly  conserved 
throughout  eukaryotic  evolution,  Tyr-296,  Gln-297,  and  His- 
301  of  the  mouse  ATEl-lp  being  among  the  most  highly  con¬ 
served  residues  (Fig.  2C).  No  putative  members  of  the  ATE 
family  could  be  detected  among  the  currently  known  prokary¬ 
otic  ORFs.  The  most  highly  conserved  region  of  R-transferases 
is  an  82-residue  stretch  (residues  336  to  417)  of  mouse  ATElp: 
this  region  is  95,  76,  and  63%  identical  to  the  corresponding 
regions  of  the  human,  D.  melanogaster ,  and  A.  thaliana  ATElp, 
respectively  (Fig.  2B).  A  Cys  residue(s)  is  likely  to  be  a  com¬ 
ponent  of  the  active  site  of  R-transferase  (31,  32).  Among  the 
five  fully  conserved  Cys  residues  in  proteins  of  the  ATE  family, 
four  are  located  in  the  56-residue  N-terminal  region  (residues 
23  to  78  of  mouse  ATElp)  (Fig.  2B).  Conversion  of  some  of 
these  cysteines  in  S .  cerevisiae  Atelp  to  alanines  was  found  to 
decrease  greatly  the  R-transferase  activity  of  yeast  Atelp  (16a, 
32).  Furthermore,  derivatives  of  mouse  ATEl-lp  and  ATEl-2p 
that  lacked  the  first  42  residues  were  completely  inactive  in  the 
yeast-based  Asp-pgal  degradation  assay  of  a  kind  described  in 
Fig.  7  (data  not  shown).  Finally,  a  90-residue  C-terminal  trun¬ 
cation  of  S,  cerevisiae  Atelp  did  not  result  in  a  major  decrease 
of  its  R-transferase  activity  (16a).  Thus,  the  active  site  of  R- 
transferase  is  likely  to  encompass  at  least  some  of  the  above 
N-terminal  cysteines. 

Since  the  two  mammalian  R-transferases  (Fig.  4C)  are  iden¬ 
tical  in  size  and,  except  for  a  43-residue  region,  are  identical 
otherwise  as  well,  it  is  likely  that  the  previously  described 
(partially  purified)  mammalian  R-transferases  (12,  47)  were  in 
fact  mixtures  of  Atel-lp  and  Atel-2p.  On  the  other  hand, 
fractionation  of  a  crude  R-transferase  preparation  from  rabbit 
reticulocytes  did  yield,  in  addition  to  a  major  fraction  of  R- 
transferase,  chromatographically  distinct  R-transferase  frac¬ 
tions  that  were  not  investigated  further  (12).  The  ratio  of 
Atel-lp  to  Atel-2p  in  rabbit  reticulocytes  is  currently  un¬ 
known. 
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exon  of  ATEl-lp  with  the  alternative  129-bp  exon  results  in  a 
protein,  ATEl-2p,  that  has  a  significantly  (possibly  consider¬ 
ably)  lower  R-transferase  activity  (Fig.  6).  In  addition, 
ATEl-2p  is  unable  to  enter  the  nucleus  (as  a  GFP  fusion),  in 
contrast  to  ATEl-lp  (Fig.  7).  Taken  together  with  the  finding 
that  the  expression  ratio  of  the  two  Atel  mRNAs,  Atel-1  and 
Ate  1-2,  varies  up  to  a  100-fold  among  different  mouse  tissues 
(Fig.  3  and  Results),  these  data  suggest  that  the  two  R-trans¬ 
ferases  are  functionally  distinct  as  well.  Cited  below  are  some 
of  the  possibilities  that  are  consistent  with  the  available  evi¬ 
dence. 

Mouse  ATEl-2p  has  the  R-transferase  activity  but  arginyl- 
ates,  at  steady  state,  only  —50%  of  Asp-Pgal  in  atel  A  S.  cer¬ 
evisiae ,  in  contrast  to  both  ATEl-2p  and  S.  cerevisiae  Atelp 
(Fig.  6  and  Table  1).  Moreover,  the  inefficient  arginylation  by 
ATEl-2p  occurs  in  spite  of  its  overexpression  in  S.  cerevisiae. 
In  contrast,  the  yeast  Atelp,  which  in  wild- type  S.  cerevisiae  is 
a  weakly  expressed  protein  (7),  can  quantitatively  arginylate  in 
vivo  an  overexpressed  substrate  such  as  Asp-pgal  (18).  Thus,  at 
a  low  level  of  expression  (which  is  likely  to  be  the  case  in  the 
mouse),  the  ATEl-2p  R-transferase  may  be,  in  effect,  an  in¬ 
active  enzyme,  in  contrast  to  ATEl-lp.  If  so,  ATEl-2p  might 
act  as  an  (indirect)  inhibitor  of  the  ATEl-lp  function,  for 
example,  through  a  competition  with  ATEl-lp  for  the  binding 
to  a  component  of  the  targeting  complex  in  the  N-end  rule 
pathway.  (The  apparent  absence  of  such  competition  in  S. 
cerevisiae  [Fig.  6B]  may  result  from  the  lack  of  binding  by 
ATElp  to  heterologous  yeast  proteins.)  It  is  also  possible  that 
a  large  difference  in  activity  between  mouse  ATEl-lp  and 
ATEl-2p  in  yeast  reflects  not  their  different  enzymatic  activi¬ 
ties  in  the  mouse  but  a  (physiologically  irrelevant)  differential 
recognition  of  an  essential  yeast  cofactor  such  as  Arg-tRNA. 
Direct  comparisons  of  arginylation  kinetics  by  the  purified 
mouse  and  yeast  R-transferases  will  be  required  to  address  this 
unlikely  but  unexcluded  interpretation. 

Another  possibility  is  that  ATEl-2p  has  a  distinct  enzymatic 
activity  that  has  been  missed  by  the  current  N-terminal  ar¬ 
ginylation  assay  (Fig.  6  and  Table  1).  For  example,  ATEl-2p 
might  be  able  to  arginylate  an  internal  residue  in  a  substrate 
protein.  In  vitro  enzymological  dissection  of  ATEl-lp  and 
ATEl-2p  will  address  this  and  related  conjectures.  Yet  an¬ 
other  possibility  is  that  the  alternative  43-residue  regions  of 
ATEl-lp  and  ATEl-2p  confer  different  metabolic  stabilities 
on  the  two  R-transferases,  the  lower  apparent  activity  of 
ATEl-2p  in  yeast  being  due  at  least  in  part  to  its  shorter 
half-life.  A  test  of  this  model  in  mouse  cells  requires  antibodies 
specific  for  the  alternative  regions  of  the  two  R-transferases; 
preparation  of  such  antibodies  is  under  way. 

In  yeast,  the  N-end  rule  pathway  is  present  in  both  the 
cytosol  and  the  nucleus.  The  apparent  exclusion  of  mouse 
ATEl-2p  from  the  nucleus  and  the  different  ratios  of  Atel-1  to 
Atel -2  mRNA  among  the  mouse  tissues  suggest  that  the  rule 
book  of  the  N-end  rule  pathway  may  be  regulated  differentially 
in  the  cytosol  and  the  nucleus,  through  a  cell-type-specific 
expression  of  the  pathway’s  components  that  are  located  in  one 
but  not  the  other  compartment. 

Physiological  substrates  for  either  eukaryotic  R-transferases 
(54)  or  their  prokaryotic  counterparts,  L,  F- transferases  (1,  25, 
45)  are  not  known.  The  cloning  and  characterization  of  the  first 
mammalian  Atel  cDNAs  and  genes  (Fig.  2),  and  the  discov¬ 
ery  of  alternative  splicing  that  yields  mouse  ATEl-lp  and 
ATEl-2p  (Fig.  4)  should  facilitate  understanding  of  the  func¬ 
tions  of  mammalian  R-transferases,  in  part  through  the  anal¬ 
ysis  of  ATEl-lp  and  ATEl-2p  enzymes  and  also  because  it  is 
now  possible  to  construct  mouse  strains  that  lack  ATEl-lp 
and/or  ATEl-2p. 
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We  dissected  physical  and  functional  interactions 
between  the  ubiquitin-conjugating  (E2)  enzyme  Ubc2p 
and  Ubrlp,  the  E3  component  of  the  N-end  rule 
pathway  in  Saccharomyces  cerevisiae.  The  binding  of 
the  20  kDa  Ubc2p  by  the  225  kDa  Ubrlp  is  shown  to 
be  mediated  largely  by  the  basic  residue-rich  (BRR) 
region  of  Ubrlp.  However,  mutations  of  the  BRR 
domain  that  strongly  decrease  the  interaction  between 
Ubrlp  and  Ubc2p  do  not  prevent  the  degradation  of 
N-end  rule  substrates.  In  contrast,  this  degradation  is 
completely  dependent  on  the  RING-H2  finger  of  Ubrlp 
adjacent  to  the  BRR  domain.  Specifically,  the  first 
cysteine  of  RING-H2  is  required  for  the  ubiquitylation 
activity  of  the  Ubrlp-Ubc2p  complex,  although  this 
cysteine  plays  no  detectable  role  in  either  the  binding 
of  N-end  rule  substrates  by  Ubrlp  or  the  physical 
affinity  between  Ubrlp  and  Ubc2p.  These  results 
defined  the  topography  of  the  Ubc2p-Ubrlp  interaction 
and  revealed  the  essential  function  of  the  RING-H2 
finger,  a  domain  that  is  present  in  many  otherwise 
dissimilar  E3  proteins  of  the  ubiquitin  system. 
Keywords'.  E2/E3/N-end  rule/proteasome/RING  finger/ 
ubiquitin 


Introduction 

Ubiquitin  (Ub)  is  a  76  residue  protein  whose  covalent 
conjugation  to  other  proteins,  usually  in  the  form  of 
a  multi-Ub  chain,  marks  these  proteins  for  processive 
degradation  by  the  26S  proteasome,  an  ATP-dependent 
multisubunit  protease  (Hochstrasser,  1996;  Varshavsky, 
1997;  Baumeister  et  al. ,  1998;  Hershko  and  Ciechanover, 
1998;  Scheffner  et  al. ,  1998).  Through  either  constitutive 
or  conditional  degradation  of  many  intracellular  proteins, 
the  Ub-dependent  proteolytic  pathways  regulate  a  multi¬ 
tude  of  biological  processes,  including  the  cell  cycle,  cell 
differentiation,  apoptosis,  DNA  transcription,  replication 
and  repair,  signal  transduction,  functions  of  the  nervous 
system  and  stress  responses,  including  the  immune 
response  (Hicke,  1997;  Varshavsky,  1997;  Peters  et  al. , 
1998). 

The  conjugation  of  Ub  to  other  proteins  involves  the 
formation  of  a  thioester  between  the  C-terminus  of  Ub 
and  a  specific  cysteine  of  the  Ub-activating  (El)  enzyme. 
The  Ub  moiety  of  El  ~Ub  thioester  is  thereafter  transesteri- 
fied  to  a  specific  cysteine  in  one  of  several  Ub-conjugating 
(E2)  enzymes.  The  Ub  moiety  of  E2~Ub  thioester  is 


conjugated,  via  the  isopeptide  bond,  to  the  e-amino  group 
of  either  a  substrate’s  Lys  residue  or  a  Lys  residue  of 
another  Ub  moiety,  the  latter  reaction  resulting  in  a 
substrate-linked  multi-Ub  chain  (Chau  etal,  1989;  Pickart, 
1997;  Scheffner  et  al. ,  1998).  Most  E2  enzymes  function 
in  complexes  with  proteins  called  E3s.  The  functions  of 
E3s  include  the  initial  recognition  of  degradation  signals 
(degrons)  in  the  substrate  proteins,  with  different  E3s 
recognizing  different  classes  of  degrons.  At  least  some 
E3s,  specifically  those  containing  the  HECT  domain, 
accept  the  Ub  moiety  from  the  associated  E2~Ub  thioester, 
forming  an  E3~Ub  thioester  and  acting  as  a  proximal 
donor  of  the  Ub  moiety  to  substrates  selected  by  the  E3 
(Scheffner  et  al .,  1995;  Nuber  and  Scheffner,  1999;  Wang 
et  al.  1999). 

One  pathway  of  the  Ub  system  is  the  N-end  rule  pathway 
(Bachmair  et  al.,  1986;  Varshavsky,  1996).  Among  the 
targets  of  this  pathway  are  proteins  bearing  destabilizing 
N-terminal  residues.  In  the  yeast  Saccharomyces  cerevis¬ 
iae ,  Asn  and  Gin  are  tertiary  destabilizing  N-terminal 
residues,  in  that  they  function  through  their  conversion,  by 
the  V7A 7 -encoded  N-terminal  amidase,  into  the  secondary 
destabilizing  N-terminal  residues  Asp  and  Glu.  The  desta¬ 
bilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation,  by  the  ATE7-encoded  Arg-tRNA-protein 
transferase,  to  Arg,  one  of  the  primary  destabilizing 
residues  (Balzi  et  al.,  1990;  Baker  and  Varshavsky,  1995). 
In  mammals,  the  deamidation  step  is  bifurcated,  in  that 
two  distinct  amidases  specific,  respectively,  for  N-terminal 
Asn  or  Gin,  mediate  the  activity  of  tertiary  destabilizing 
residues  (Stewart  et  al.,  1995;  Grigoryev  et  al,  1996). 
The  primary  destabilizing  N-terminal  residues  are  bound 
directly  by  the  UBR 1 -encoded  E3  (N-recognin),  the  recog¬ 
nition  component  of  the  N-end  rule  pathway.  In  S. cerevis¬ 
iae,  Ubrlp  is  a  225  kDa  protein  that  binds  to  potential 
N-end  rule  substrates  through  their  primary  destabilizing 
N-terminal  residues  Phe,  Leu,  Trp,  Tyr,  lie,  Arg,  Lys  and 
His  (Bartel  et  al.,  1990).  Ubrlp  contains  at  least  three 
substrate-binding  sites.  The  type  1  site  is  specific  for  basic 
N-terminal  residues  Arg,  Lys  and  His.  The  type  2  site  is 
specific  for  the  hydrophobic  residues  Phe,  Leu,  Trp,  Tyr 
and  lie  (Reiss  et  al,  1988;  Gonda  et  al. ,  1989;  Baker  and 
Varshavsky,  1991).  Ubrlp  also  targets  short-lived  proteins 
such  as  Cup9p  and  Gpalp,  which  lack  destabilizing 
N-terminal  residues,  and  are  recognized  by  Ubrlp  through 
its  third  substrate-binding  site,  the  exact  location  and 
specificity  of  which  remain  to  be  determined  (Madura  and 
Varshavsky,  1994;  Byrd  et  al,  1998).  Similar  but  distinct 
versions  of  the  N-end  rule  pathway  are  present  in  all 
organisms  examined,  from  bacteria  to  mammals.  For  a 
summary  of  the  currently  known  functions  of  this  pathway, 
see  Kwon  et  al.  (1999a,b). 

The  roles  of  E3  proteins  in  the  E2,  E3-dependent 
ubiquitylation  of  substrates  are  not  understood  in  detail. 
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It,  has  been  shown  that  the  UBR 1 -encoded  N-recognin, 
the  E3  of  the  N-end  rule  pathway,  functions  in  a  complex 
with  Ubc2p  (Rad6p)  (Madura  et  al. ,  1993;  Watkins  et  al. , 
1993),  one  of  13  E2  enzymes  in  S.cerevisiae  (Hochstrasser, 
1996).  The  polyacidic  C-terminal  tail  of  Ubc2p  was  found 
to  be  essential  for  the  binding  of  Ubc2p  to  Ubrlp,  but  the 
Ubc2p-binding  site  of  Ubrlp  remained  unknown.  The 
mechanism  of  cooperation  between  E3  (Ubrlp)  and  E2 
(Ubc2p)  in  forming  a  substrate-linked  multi-Ub  chain  is 
unknown  for  either  this  or  other  Ub-dependent  pathways. 

We  report  two  main  results,  (i)  The  Ubrlp-Ubc2p 
interaction  is  mediated  by  the  basic  residue-rich  (BRR) 
region  of  Ubrlp  and  the  polyacidic  tail  of  Ubc2p.  Interes¬ 
tingly,  this  high-affinity  interaction  is  not  essential  for  the 
activity  of  the  N-end  rule  pathway,  (ii)  The  RING-H2 
domain  of  Ubrlp,  adjacent  to  the  BRR  region,  is  strictly 
required  for  ubiquitylation  of  N-end  rule  substrates,  even 
though  this  domain  is  not  required  for  the  binding  of 
Ubrlp  to  either  Ubc2p  or  N-end  rule  substrates.  Because 
the  RING-H2  and  similar  RING  variants  are  a  character¬ 
istic  feature  of  many  otherwise  dissimilar  E3  proteins,  the 
ubiquitylation-enabling  function  of  the  Ubrlp  RING-H2 
identified  in  the  present  work  is  likely  to  be  a  general 
property  of  E3  components  in  the  Ub  system. 

Terminology 

Ubiquitin  whose  C-terminal  (Gly76)  carboxyl  group  is 
covalently  linked  to  another  compound  is  called  the 
ubiquity l  moiety,  the  derivative  terms  being  ubiquitylation 
and  ubiquitylated.  The  term  Ub  refers  to  both  free  ubiquitin 
and  the  ubiquityl  moiety.  This  nomenclature,  which  is 
also  recommended  by  the  Nomenclature  Committee  of 
the  International  Union  of  Biochemistry  and  Molecular 
Biology,  brings  Ub-related  terms  in  line  with  the  standard 
chemical  terminology  (Varshavsky,  1997). 

Results 

Dissection  of  the  Ubr1p-Ubc2p  interaction 

To  search  for  Ubc2p-interacting  proteins,  we  employed 
the  two-hybrid  technique  (Fields  and  Song,  1989;  James 
et  ai .,  1996),  using  the  20  kDa  Ubc2p  as  bait.  A  plasmid 
thus  identified  contained  a  fragment  of  the  S.cerevisiae 
open  reading  frame  (ORF)  YLR024C,  which  was  previ¬ 
ously  termed  UBR2  (Hochstrasser,  1996;  Kwon  et  al ., 
1998),  because  the  predicted  sequence  of  the  216  kDa 
Ubr2p  was  22%  identical  and  46%  similar  to  that  of  the 
225  kDa,  UBR1 -encoded  N-recognin,  the  E3  of  the  N-end 
rule  pathway  (Bartel  et  ai ,  1990).  Ubr2p  is  not  essential 
for  cell  viability  under  standard  growth  conditions  and 
does  not  target  N-end  rule  substrates,  but  does  compete 
with  Ubrlp  for  the  binding  to  Ubc2p  (H.Rao,  Y.Xie  and 
A. Varshavsky,  unpublished  data). 

The  cloned  Ubc2p-interacting  region  of  Ubr2p  was  a 
287  residue  fragment  (positions  1134-1420).  Sequence 
comparisons  revealed  two  distinct  motifs  conserved 
between  Ubrlp  and  Ubr2p  in  this  region  (Figure  1A).  One 
was  a  basic  residue-rich  region,  termed  BRR,  and  the 
other  a  RING-H2  finger  (Saurin  et  al. ,  1996;  Kwon  et  al ., 
1998).  The  RING-H2  motif  is  a  distinct  variant  of  the 
previously  defined  Cys/His-rich  RING  motif,  which  is 
thought  to  be  involved  in  protein-protein  interactions 
(Borden  and  Freemont,  1996;  Inouye  et  al. ,  1997).  Two- 


hybrid  analyses  indicated  that  Ubc2p  interacted  with  the 
Ubr2p-homologous  region  of  Ubrlp  as  well  (positions 
1081-1367)  (Figure  1C  and  D).  In  addition,  two-hybrid 
assays  with  a  series  of  Ubrlp  fragments  fused  to  the 
Gal4p  transcriptional  activation  domain  demonstrated  that 
the  287  residue  region  was  the  only  detectable  Ubc2p- 
binding  site  in  Ubrlp  (data  not  shown).  We  used  site- 
directed  mutagenesis  to  dissect  the  Ubc2p-Ubrlp  inter¬ 
action  in  this  region  of  Ubrlp.  Two  sets  of  mutants  were 
constructed  (Figure  IB).  The  mutants  of  one  set,  termed 
MB  (mutation  in  BRR),  bore  a  missense  mutation(s)  in 
the  BRR  domain;  the  mutants  of  another  set,  termed  MR 
(mutation  in  RING-H2)  bore  either  Cys— >Ser  or  His-^Ala 
mutations  in  the  RING-H2  finger.  These  derivatives  of 
the  287  residue  region  of  Ubrlp  were  tested  for  their 
binding  to  Ubc2p  using  a  version  of  the  two-hybrid  assay 
that  utilized  two  reporter  genes,  ADE2  and  HIS3  (James 
et  al. ,  1996).  This  design  increased  the  assay’s  affinity 
range,  because  a  strong  two-hybrid  interaction  was 
required  to  confer  on  the  tester  cells  the  Ade+  phenotype, 
whereas  a  weak  two-hybrid  interaction  sufficed  to  make 
the  same  tester  cells  His+  (James  et  al .,  1996). 

The  replacement  of  a  single  basic  residue  at  various 
positions  in  the  BRR  domain  of  Ubrlp  with  uncharged 
residues  (e.g.  the  alleles  MB  10,  MB  11  and  MB  17)  sig¬ 
nificantly  weakened  the  interaction  of  Ubc2p  with  the 
287  residue  fragment  of  Ubrlp,  as  indicated  by  the  Ade~ 
His+  phenotype  of  the  tester  strain,  in  comparison  with 
the  Ade+  His+  phenotype  observed  with  the  fragment’s 
wild-type  (wt)  version  (Figure  1C).  When  two  basic 
residues  of  the  BRR  domain  were  converted  to  uncharged 
residues  (MB4  allele),  the  resulting  tester  strain  was  still 
His+,  but  grew  more  slowly  on  histidine-lacking  medium 
than  did  the  tester  strain  bearing  single-substitution  Ubrlp 
mutants  (Figure  1C;  data  not  shown),  suggesting  a  further 
weakening  of  the  Ubc2p-Ubrlp  interactions.  With  triple¬ 
substitution  mutants  in  the  BRR  domain  of  Ubrlp  (MB5 
and  MB6),  the  affinity  of  Ubc2p  for  the  287  residue  region 
of  Ubrlp  decreased  to  levels  undetectable  with  the  two- 
hybrid  assay  (Figure  1C). 

In  contrast  to  these  findings  with  the  BRR  domain, 
none  of  the  four  single  Cys— >Ser  substitutions  in  the 
adjacent  RING-H2  finger  of  Ubrlp  impaired  the  interaction 
of  Ubc2p  with  the  287  residue  region  of  Ubrlp  (Figure  1C, 
mutants  MR1-MR3  and  MR18).  A  His— >Ala  mutation  in 
the  RING-H2  finger  was  also  without  effect  (Figure  1C, 
mutant  MR  12).  Moreover,  even  the  simultaneous 
Cys^Ser  alterations  at  two  positions  of  the  RING-H2 
finger  failed  to  affect  the  Ade+  His+  phenotype  of  the 
tester  strain  (Figure  1C,  mutants  MR7  and  MR8).  The 
287  residue  region  of  Ubrlp  was  then  subcloned  into  two 
fusions,  which  contained,  respectively,  the  BRR  domain 
alone  (positions  1081-1220)  and  the  RING-H2  finger 
alone  (positions  1177-1367).  The  RING-H2  finger  alone 
did  not  exhibit  affinity  to  Ubc2p  (Figure  ID).  In  contrast, 
the  binding  of  Ubc2p  to  the  BRR  domain  alone  was 
readily  detectable,  but  this  interaction  was  weakened  in 
comparison  with  the  interaction  between  Ubc2p  and  the 
287  residue  region  of  Ubrlp  containing  both  BRR  and 
RING-H2  (Figure  ID). 

Taken  together,  these  results  indicated  that  the  positively 
charged  BRR  domain  is  the  main  region  mediating  the 
binding  of  Ubrlp  to  Ubc2p,  through  largely  electrostatic 
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Fig.  1.  Ubc2p  binds  to  the  BRR  domain  of  Ubrlp.  (A)  Sequence  comparison  of  the  Ubc2p-binding  domains  of  S.cerevisiae  Ubrlp  and  Ubr2p. 
Identical  residues  are  highlighted  in  black;  similar  residues  are  shaded.  The  conserved  and  similar  basic  residues  are  marked  by  ‘  +  \  The  Cys  and 
His  residues  of  the  RING-H2  motif  are  denoted  by  asterisks.  (B)  Schematic  representation  of  mutations  in  the  BRR  and  RING-H2  domains  of 
Ubrlp.  The  287  residue  region  of  the  1950  residue  Ubrlp  (positions  1081-1367)  was  divided  into  two  regions  containing  the  BRR  domain  and  the 
RING-H2  finger.  Mutations  of  basic  residues,  termed  the  MB  mutations,  in  the  BRR  domain  were  at  positions  1165-1175  and  are  underlined  in  bold. 
Mutations  of  either  Cys  or  His  residues  in  the  RING-H2  finger,  termed  the  MR  mutations,  were  at  positions  1220-1323  and  are  underlined  in  bold. 
The  positions  of  mutated  Cys  and  His  residues  within  RING-H2  are  listed  in  parentheses.  (C)  Summary  of  the  two-reporter  ( HIS3  and  ADE2) 
two-hybrid  results  with  the  MB  and  MR  mutants  of  Ubrlp  versus  wt  Ubc2p.  See  the  text  and  Materials  and  methods  for  details.  (D)  wt  Ubc2p  and 
its  two  truncated  alleles,  Ubc2p]_149  and  Ubc2pia_i72,  were  assayed  in  the  two-reporter/two-hybrid  system  for  their  interaction  with  either  the 
287  residue  Ubrlp  fragment,  or  its  BRR  domain  alone,  or  its  RING-H2  domain  alone.  The  9  residue  N-terminal  region  and  the  polyacidic 
C-terminal  tail  of  Ubc2p  are  denoted  by  the  black  and  hatched  boxes,  respectively. 


interactions  of  the  BRR  region  with  the  polyacidic 
C-terminal  tail  of  Ubc2p  (see  below).  These  findings  also 
indicated  that  the  presence  of  the  RING-H2  finger  near 
the  BRR  domain  was  not  essential  for  the  binding  of 
Ubc2p  but  was  required  (in  an  undefined  way;  but  see 
below)  for  the  high-affinity  Ubrlp-Ubc2p  interaction 
(Figure  1B-D).  In  addition,  either  single  or  double 
Cys— >Ser  substitutions  in  the  RING-H2  finger  did  not 
appear  to  result  in  a  gross  conformational  alteration  of 
the  BRR/RING-H2  region  of  Ubrlp,  as  indicated  by  the 
undiminished  affinity  of  mutants  in  this  region  for  Ubc2p 
(Figure  1C);  further  evidence  for  this  is  described  below. 

Previous  work  has  shown  that  both  the  N-terminal 
9  residue  region  and  the  23  residue  polyacidic  C-terminal 
tail  of  Ubc2p  are  required  for  the  binding  of  Ubc2p  to 
Ubrlp  (Madura  et  aL,  1993;  Watkins  et  al. ,  1993).  To 
determine  the  relative  contributions  of  these  Ubc2p  regions 
to  the  interaction  between  Ubc2p  and  the  BRR  domain  of 


Ubrlp,  we  constructed  two  hybrid  fusions  containing  two 
truncated  alleles  of  UBC2  which  encoded,  respectively, 
Ubc2p10_i72  (lacking  residues  1-9)  and  Ubc2p!_149  (lacking 
the  polyacidic  tail).  These  derivatives  of  Ubc2p  were 
tested  in  the  two-hybrid  assay  for  binding  to  the  entire 
287  residue  region  of  Ubrlp,  or  to  the  BRR  domain  alone, 
or  to  the  RING-H2  finger  alone.  The  deletion  of  the 
polyacidic  C-terminal  tail  of  Ubc2p  abolished  its  inter¬ 
action  with  the  BRR  domain  of  Ubrlp,  whereas  the 
deletion  of  the  first  9  residues  of  Ubc2p  weakened  but 
did  not  abolish  this  interaction  (Figure  ID).  Interestingly, 
the  latter  interaction,  weakened  but  readily  detectable,  was 
independent  of  the  presence  of  the  RING-H2  finger 
(Figure  ID).  Thus,  the  bulk  of  the  affinity  of  Ubc2p  for 
the  287  residue  fragment  of  Ubrlp  stems  from  electrostatic 
interactions  between  the  polyacidic  C-terminal  tail  of 
Ubc2p  and  the  positively  charged,  -140  residue  BRR 
domain  of  the  1950  residue  Ubrlp.  (The  size  of  a 
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Fig.  2.  The  BRR  domain  of  Ubrlp  is  essential  for  high-affinity 
binding  to  Ubc2p.  (A)  GST  pull-down  assays  with  wt  Ubrlp  and  its 
mutants.  FLAG-Ubrlp  (denoted  as  wt  UBR1)  (lane  2),  or  FLAG- 
Ubr1pMRI  (lane  3)  or  FLAG-UbrlpMR2  (lane  4),  or  FLAG-UbrlpMB6 
(lane  5)  were  overexpressed  in  AVY301  {ubrl A  w/?c2A)  cells  (see 
Materials  and  methods).  Cells  were  labeled  with  [35S]methionine  for 
30  min  at  30°C,  and  the  extracts  were  assayed  using  glutathione- 
agarose  beads  bound  with  GST-Ubc2p  (top  panel)  or  applied  to 
immunoprecipitation  with  anti-FLAG  monoclonal  antibody  (lower 
panel).  (B)  In  vivo  association  of  Ubc2p  with  wt  Ubrlp,  UbrlpMR1 
and  UbrlpMR2,  but  not  with  UbrlpMB6.  Cell  extracts  from 
transformants  expressing  FLAG-Ubrlp  alone  (lane  1)  or  the 
N-terminally  HA-tagged  Ubc2p  (HA-Ubc2p)  alone  (lane  2),  or 
co-expressing  HA-Ubc2p  with  either  FLAG-Ubrlp  (lane  3)  or 
FLAG-UbrlpMB6  (lane  4),  FLAG-UbrlpMR1  (lane  5)  or  FLAG- 
UbrlpMR2  (lane  6)  were  incubated  with  anti-HA  antibody  (see 
Materials  and  methods).  Co-precipitated  wt  Ubrlp  and  its  mutant 
derivatives  were  resolved  by  SDS-PAGE,  followed  by  immunoblotting 
with  anti-FLAG  antibody  (top  panel).  The  asterisk  marks  a  cross¬ 
reactive  band.  The  expression  levels  of  Ubrlp  and  Ubc2p  in  these 
transformants  were  examined  by  immunoblotting  with,  respectively, 
anti-FLAG  (middle  panel)  and  anti-HA  antibodies  (bottom  panel). 
SDS-PAGE  was  carried  out  in  8%  gels. 


contiguous  region  within  the  BRR  domain  that  is  required 
for  the  binding  of  Ubc2p  is  likely  to  be  significantly  less 
than  140  residues.) 

To  determine  whether  basic  residues  of  the  BRR  domain 
and  the  RING-H2  finger  were  essential  for  the  binding  of 
the  full-length  Ubrlp  to  Ubc2p,  we  first  employed  a 
pull-down  assay  with  a  fusion  of  Ubc2p  to  glutathione 
5-transferase  (GST).  Specifically,  wt  FLAG-Ubrlp  (tagged 
at  the  N-terminus  with  the  FLAG  epitope),  or  FLAG- 
Ubrlp1^66  (a  triple-substitution  mutant  in  the  BRR  domain; 
Figure  IB),  or  FLAG-UbrlpMR1  or  FLAG-UbrlpMR2  (a 
single  Cys— >Ser  substitution  at  Cysl  and  Cys5,  respect¬ 
ively;  Figure  IB)  was  overexpressed  in  a  ubrl  A  ubc2A 
S.cerevisiae  strain  and  labeled  with  [35S]methionine.  The 
N-terminal  FLAG  epitope  did  not  impair  the  function  of 
Ubrlp  (A.Webster  and  A.Varshavsky,  unpublished  data). 
The  extracts  from  labeled,  Ubrlp-overexpressing  cells 
(and  the  extract  from  cells  carrying  vector  alone)  were 
incubated  with  glutathione-agarose  beads  loaded  with 
GST-Ubc2p,  and  the  amounts  of  35S-labeled  FLAG- 
Ubrlp  and  Ubrlp  mutants  recovered  from  the  beads  were 
determined  by  SDS-PAGE  (Figure  2A,  top  panel).  In 
parallel,  the  same  extracts  were  immunoprecipitated  with 
a  monoclonal  anti-FLAG  antibody,  to  verify  equality  of 
the  initial  inputs  of  FLAG-Ubrlp  and  its  mutants 
(Figure  2A,  lower  panel).  In  agreement  with  the  results 
of  the  two-hybrid  assays  (Figure  IB  and  C),  the  binding 


of  FLAG-UbrlpMB6  to  Ubc2p  was  barely  detectable 
(Figure  2A,  top  panel,  lane  5),  whereas  the  binding  of 
either  FLAG-UbrlpMRI  or  FLAG-UbrlpMR2  to  Ubc2p  was 
comparable  to  that  of  wt  FLAG-Ubrlp  (Figure  2 A,  top 
panel,  lanes  2-4). 

We  also  examined  the  in  vivo  association  of  wt  Ubrlp 
and  its  mutants  with  Ubc2p,  using  co-immunoprecipi- 
tation-immunoblot  assays.  Either  wt  FLAG-Ubrlp, 
FLAG-UbrlpMB6,  FLAG-UbrlpMR1  or  FLAG-UbrlpMR2 
was  co-overexpressed  in  a  ubrl  A  ubclA  S.cerevisiae 
with  Ubc2p  bearing  the  N-terminal  hemagglutinin  (HA) 
epitope.  Controls  included  congenic  cells  expressing 
FLAG-Ubrlp  alone  or  Ubc2p  alone.  Proteins  were 
immunoprecipitated  from  cell  extracts  with  anti-HA  anti¬ 
body,  followed  by  SDS-PAGE  and  immunoblotting  with 
anti-FLAG  antibody  (Figure  2B,  top  panel).  The  expres¬ 
sion  levels  of  Ubrlp  and  Ubc2p  in  these  transformants 
were  monitored  by  immunoblotting  the  initial  extracts 
with  either  anti-FLAG  antibody  (Figure  2B,  middle  panel) 
or  anti-HA  antibody  (Figure  2B,  bottom  panel).  In  agree¬ 
ment  with  the  results  of  two-hybrid  and  pull-down  assays, 
the  in  vivo  association  of  FLAG-UbrlpMB6  with  Ubc2p 
was  undetectable  (lane  4),  whereas  both  FLAG-UbrlpMRl 
and  FLAG-UbrlpMR2  bound  Ubc2p  as  efficiently  as  wt 
FLAG-Ubrlp  (lanes  3,  5  and  6).  Thus,  the  BRR  domain 
of  Ubrlp  is  essential  for  the  binding  of  Ubrlp  to  Ubc2p, 
whereas  the  adjacent  RING-H2  finger  of  Ubrlp  is  not 
directly  involved  (Figures  IB,  C  and  2). 

High  affinity  between  Ubrlp  and  Ubc2p  is  not 
essential  for  degradation  of  N-end  rule  substrates 

We  asked  whether  the  MB-type  point  mutations  in  the 
BRR  domain  of  Ubrlp  that  virtually  abolished  the  binding 
of  Ubrlp  to  wt  Ubc2p  (Figures  IB,  C  and  2)  affected  the 
activity  of  the  N-end  rule  pathway,  wt  Ubrlp  and  the 
BRR-domain  mutants  UbrlpMB4  and  UbrlpMB6  (Figure  1C 
and  D)  were  expressed  in  ubrl  A  cells  from  a  low-copy 
plasmid  and  the  native  Pubri  promoter.  The  ubrl  A  cells 
also  expressed  an  N-end  rule  substrate,  either  Arg-fJgal  or 
Leu-|3gal,  which  are  short-lived  in  UBR1  cells  (/1/2  of  ~2 
and  -3  min,  respectively)  (Varshavsky,  1996).  Arg-Pgal 
and  Leu-pgal  were  expressed  as  Ub-Arg-pgal  and  Ub- 
Leu-pgal,  which  were  co-translationally  deubiquitylated 
in  vivo  (Varshavsky,  1996).  Previous  work  has  also  shown 
that  the  enzymatic  activity  of  an  N-end  rule  substrate 
such  as  X-Pgal  is  a  sensitive  measure  of  the  substrate’s 
metabolic  stability  in  vivo  (Madura  et  al. ,  1993;  Kwon 
et  al. ,  1999a). 

Both  Arg-pgal  and  Leu-pgal  were  long-lived  in  vivo  in 
the  absence  of  Ubrlp  (?1/2  >20  h)  (‘vector’  bars  in 
Figure  3A;  data  not  shown).  As  expected,  Arg-pgal  and 
Leu-Pgal  were  degraded  in  the  presence  of  wt  Ubrlp 
(Figure  3A).  Surprisingly,  the  steady-state  levels  of  Arg- 
Pgal  and  Leu-Pgal  in  the  transformants  with  UbrlpMB4  or 
UbrlpMB6  were  similar  to  (or  only  slightly  higher  than) 
the  levels  of  these  proteins  in  transformants  expressing 
wt  Ubrlp  (Figure  3A),  indicating  that  high-affinity  binding 
of  Ubrlp  to  Ubc2p  is  not  essential  for  the  activity  of  the 
N-end  rule  pathway.  Control  experiments  with  ubc2A  cells 
confirmed  that  the  UbrlpMB6-dependent  degradation  of 
N-end  rule  substrates  was  Ubc2p-dependent  as  well  (data 
not  shown). 

Previous  work  with  the  tailless  Ubc2p!_149  mutant, 
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Fig.  3.  High-affinity  interaction  between  Ubrlp  and  Ubc2p  is  not  essential  for  the  activity  of  the  N-end  rule  pathway.  (A)  The  activity  of  Ubrlp  is 
weakly  affected  by  MB  mutations.  All  UBR1  alleles  were  expressed  from  the  PUBri  promoter  in  a  low-copy  vector.  Constructs  expressing  wt  Ubrlp, 
UbrlpMB4  or  UbrlpMB6,  or  the  empty  vector  were  transformed  into  JD55  (ubrl A)  cells  expressing  Arg-pgal  or  Leu-pgal.  Measurements  of  j3gal 
activity  were  used  to  assay  the  degradation  of  Arg-pgal  and  Leu-Pgal.  The  values  shown  are  the  means  of  duplicate  measurements  of  three 
independent  transformants.  Standard  deviations  are  indicated  above  the  bars.  (B  and  C)  High-affinity  association  of  Ubrlp  and  Ubc2p  is  not  essential 
for  activity  of  the  N-end  rule  pathway,  pgal  activity  assays  (B)  and  immunoprecipitations  with  anti-pgal  antibody  (C)  were  used  to  compare  the 
degradation  of  model  substrates,  either  Arg-pgal  (B  and  C)  or  Tyr-Pgal  (B).  All  UBR1  and  UBC2  alleles  were  expressed  from  the  native  PUBR1  and 
PUBC2  promoters,  respectively,  in  a  low-copy  vector.  Different  combinations  of  UBR1  and  UBC2  alleles  were  transformed  into  ubrl  A  ubc2A  cells 
expressing  either  Arg-Pgal,  Tyr-pgal  (B)  or  DHFRha-UbR48-Arg-Pgal  (C)  (see  the  text  and  Materials  and  methods).  The  measured  ratio  of  Arg-pgal 
to  DHFR-Ub  (the  reference  protein)  in  the  absence  of  Ubrlp  [(C),  lane  1]  was  set  at  1.00  and  used  to  normalize  the  other  ratios. 


which  did  not  bind  to  wt  Ubrlp,  led  to  the  conclusion 
that  Ubc2p1_149  did  not  support  the  degradation  of  N-end 
rule  substrates  (Madura  et  al. ,  1993),  and  implied  that 
high-affinity  binding  of  Ubc2p  to  Ubrlp  was  strictly 
essential  for  the  pathway’s  function.  However,  another 
study  reported  a  diminished  but  significant  activity 
Ubc2p1_149  (Watkins  et  al. ,  1993).  The  latter  conclusion 
was  consistent  with  our  findings  about  the  BRR-region 
mutants  of  Ubrlp  (Figure  3A).  To  address  this  discrepancy, 
we  compared  the  activity  of  the  N-end  rule  pathway  in 
ubrl  A  ubc2A  cells  expressing  different  combinations  of 
either  wt  Ubrlp  or  UbrlpMB6  with  either  wt  Ubc2p  or  the 
tailless  Ubc2p  (Ubc2p1_149).  UBR1 ,  UBC2  and  their  mutant 
alleles  were  expressed  from  their  native  promoters  and  a 
low-copy  plasmid.  The  relative  rates  of  degradation  of 
Arg-pgal  and  Tyr-pgal  (^/2  of  ~2  and  ~10  min,  respect¬ 
ively)  (Bachmair  et  al .,  1986;  Varshavsky,  1996)  were 
assayed  by  measuring  the  activity  of  pgal  (Figure  3B). 
With  wt  Ubc2p,  the  activity  of  the  N-end  rule  pathway 
did  not  change  significantly  between  the  wt  Ubrlp  and 
UbrlpMB6  alleles  (Figure  3B).  Furthermore,  in  the  presence 
of  wt  Ubrlp,  the  pathway’s  activity  decreased  significantly 
but  not  strongly  upon  the  replacement  of  wt  Ubc2p  with 
its  tailless  derivative  Ubc2p1_149  (Figure  3B),  in  agreement 
with  the  finding  of  Watkins  et  al.  (1993)  but  in  contrast 
to  the  result  of  Madura  et  al.  (1993).  Interestingly,  the 
activity  of  the  N-end  rule  pathway  was  further  decreased, 
but  still  not  abolished,  by  combining  UbrlpMB6  and 
Ubc2p1_149  (Figure  3B). 

To  measure  the  relative  metabolic  stability  of  Arg-Pgal 
in  a  different  way,  we  utilized  a  recently  developed 
UPR  (Ub/protein/reference)  technique,  which  provides  a 
reference  protein  (Levy  et  al .,  1996,  1999).  This  method 
employs  a  linear  fusion  such  as  DHFRha-UbR48-X-Pgal, 
in  which  UbR48,  containing  Arg  instead  of  Lys  at  position 
48,  is  placed  between  a  protein  of  interest  (X-pgal)  and 
a  reference  protein  moiety  such  as  DHFRha  (mouse 


dihydrofolate  reductase  bearing  the  HA  epitope  tag)  (Levy 
et  al. ,  1996).  DHFRha-UbR48-X~Pgal  is  co-translationally 
cleaved  by  Ub-specific  deubiquitylating  enzymes  (DUBs) 
(Wilkinson  and  Hochstrasser,  1998)  after  the  last  residue 
of  UbR4S,  producing  equimolar  amounts  of  X-pgal  and 
DHFRha-UbR48.  This  way,  the  relative  amounts  of  X-pgal 
can  be  normalized  against  the  (co-immunoprecipitated) 
DHFRha-UbR48  reference  protein  in  the  same  sample.  The 
UPR  technique  can  therefore  compensate  for  the  scatter 
of  immunoprecipitation  yields,  sample  volumes,  and  other 
sources  of  sample-to- sample  variation  (Levy  et  al .,  1996). 
ubrl  A  ubc2A  cells  expressing  DHFRha-UbR48-Arg-pgal, 
and  wt  Ubrlp  or  UbrlpMB6  with  wt  Ubc2p  or  Ubc2p1_149 
were  labeled  with  [35S]methionine  for  30  min  at  30°C. 
Cell  extracts  were  immunoprecipitated  with  both  anti-Pgal 
and  anti-HA  antibodies,  followed  by  SDS-PAGE  and  the 
determination,  using  a  Phosphorlmager,  of  the  ratio  of 
Arg-pgal  (test  protein)  to  DHFRha-UbR48  (reference  pro¬ 
tein)  (Figure  3C).  The  ratio  determined  for  UBR1 -lacking 
cells  was  arbitrarily  set  at  1.00,  and  was  used  to  normalize 
the  same  ratios  from  other  transformants.  The  results 
(Figure  3C)  were  in  agreement  with  those  obtained  by 
measuring  the  enzymatic  activity  of  Pgal  (Figure  3A  and 
B).  Specifically,  in  the  presence  of  wt  Ubc2  and  UbrlpMB6, 
-90%  of  Arg-Pgal  was  degraded  under  the  assay’s  condi¬ 
tions,  similar  to  the  result  with  wt  Ubc2p  and  wt  Ubrlp. 
In  cells  expressing  wt  Ubrlp  and  Ubc2p1_149,  -70%  of 
Arg-Pgal  was  degraded.  With  UbrlpMB6  and  Ubc2p1„149 
the  degradation  of  Arg- Pgal  was  decreased  further,  to 
-50%,  but  still  not  abolished  (Figure  3C). 

Taken  together,  these  results  (Figure  3)  indicated  that 
high-affinity  interactions  between  the  E3  Ubrlp  and  the 
E2  Ubc2p  are  essential  for  normal  levels  of  the  pathway’s 
activity.  These  data  also  showed  that  even  much  weaker 
interactions  between  mutant  variants  of  Ubrlp  and  Ubc2p, 
the  interactions  that  could  not  be  detected  by  either  two- 
hybrid  or  pull-down  assays  (Figures  1  and  2),  were  still 
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sufficient  to  sustain  a  large  fraction  of  activity  of  the 
N-end  rule  pathway  (Figure  3).  We  conclude  that  a 
transient  interaction  between  Ubrlp  and  Ubc2p  is  function¬ 
ally  suboptima]  but  still  sufficient  for  the  multiubiquityl- 
ation  of  a  substrate  and  for  the  substrate’s  delivery  to  the 
26S  proteasome. 

The  RING-H2  finger  of  Ubrlp  is  essential  for 
degradation  of  N-end  rule  substrates 

Given  the  proximity  of  the  104  residue  RING-H2  finger 
to  the  BRR  region  of  Ubrlp  that  mediates  its  binding  to 
Ubc2p  (Figure  1),  we  determined  the  effects  of  missense 
mutations  in  RING-H2  (Figure  IB)  on  the  ability  of  Ubrlp 
to  target  N-end  rule  substrates  for  ubiquitylation  and 
degradation.  Since  the  structure  of  the  folded  RING-H2 
domain  is  unknown,  we  constructed  single  missense 
mutants  throughout  this  domain  (Figure  IB).  The 
sequences  RING-H2mr1,  RING-H2mris,  RING-H2mr3  and 
RING-H2MR2  contained  single  CyS“>Ser  substitutions  at 
the  Cysl,  Cys2,  Cys4  and  Cys5  positions  of  RING-H2, 
respectively  (Figure  1A  and  B).  RING-H2MR1 2  contained 
a  Hi s— > Ala  substitution  at  the  Hisl  position  of  RING-H2 
(Figure  1A  and  B).  These  RING-H2  variants  were  intro¬ 
duced  into  wt  Ubrl.  The  resulting  mutant  proteins, 
UbrlpMR1,  UbrlpMR2,  UbrlpMR3,  UbrlpMR12  and 
UbrlpMR18,  were  expressed  in  ubrl  A  cells  from  a  low- 
copy  plasmid,  the  native  Pubri  promoter,  and  in  the 
presence  of  either  Arg-(3gal  or  Leu-pgal,  as  described  for 
the  BRR-domain  mutants  of  Ubrlp  (Figure  3). 

In  striking  contrast  to  the  negligible  effect  of  mutations 
in  the  RING-H2  finger  of  Ubrlp  on  the  binding  of  Ubrlp 
to  Ubc2p  (Figures  1C,  D  and  2),  all  of  the  tested 
RING-H2  mutations  except  MR12  had  a  major  inhibitory 
effect  on  the  degradation  of  N-end  rule  substrates 
(Figure  4).  Specifically,  UbrlpMR1,  which  bore  a  Cys— >Ser 
substitution  at  Cysl  of  the  RING-H2  finger,  was  com¬ 
pletely  inactive  as  an  E3  (N-recognin),  in  that  the 
steady-state  levels  of  Arg-pgal  and  Leu-Pgal  in  cells 
expressing  UbrlpMRI  were  indistinguishable  from  the 
levels  of  these  substrates  in  ubrl  A  cells,  which  lacked  the 
N-end  rule  pathway  (Figure  4).  Other  mutations  of  the 
RING-H2  finger  impaired  the  N-end  rule  pathway  strongly 
but  incompletely,  whereas  the  MR  12  mutation  (Hi s— > Ala 
at  Hisl  of  RING-H2)  had  a  relatively  weak  effect 
(Figure  4). 

Previous  work  has  shown  that  Ubrlp  is  a  rate-limiting 
component  of  the  N-end  rule  pathway,  in  that  overexpres¬ 
sion  of  Ubrlp  accelerated  the  degradation  of  N-end 
rule  substrates  (Bartel  et  al. ,  1990).  We  introduced  the 
apparently  inactive  UbrlpMR1  and  the  partially  active 
UbrlpMR2  mutations  into  wt  Ubrlp  bearing  the  N-terminal 
FLAG  epitope  and  expressed  the  resulting  proteins  from 
the  PAdhi  promoter  on  a  high-copy  plasmid.  Overexpres¬ 
sion  of  the  partially  active  UbrlpMR2  increased  the  activity 
of  the  N-end  rule  pathway  in  comparison  with  cells 
expressing  UbrlpMR2  at  wt  levels  (Figure  4).  In  contrast, 
the  effect  of  overexpressing  UbrlpMR1  was  marginal,  in 
that  only  ~4%  of  Leu-Pgal  and  <20%  of  Arg-Pgal  were 
degraded,  in  comparison  with  virtually  no  degradation  at 
physiological  levels  of  UbrlpMR1  (Figure  4).  Finally,  the 
-90  kDa  in  vivo  cleavage  product  of  pgal  (denoted  by  an 
asterisk),  which  is  characteristic  of  short-lived  X-Pgal 
substrates  and  is  not  produced  in  the  absence  of  either 


Fig.  4.  Proteolysis  by  the  N-end  rule  pathway  requires  the  RING-H2 
finger  of  Ubrlp.  Methods  described  in  the  legend  to  Figure  3  and 
Materials  and  methods  were  used  to  assess  the  degradation  of  test 
substrates  Arg-pgal  (A  and  B)  and  Leu-pgal  (A)  in  the  presence  of 
different  UBRl  alleles  (see  Figure  IB).  All  UBR1  alleles  were 
expressed  from  the  native  Pubri  promoter  in  a  low-copy  vector.  The 
UbrlpMR!  (ADH1MR1)  and  UbrlpMR2  (ADH1MR2)  mutants  were 
also  expressed  from  the  Padhi  promoter  in  a  high-copy  vector. 
Constructs  expressing  wt  Ubrlp  and  its  mutants  were  transformed 
into  JD55  (ubrl A)  cells  expressing  the  Arg-pgal  or  Leu-pgal 
[(A),  pgal  activity  assays],  or  DHFRha-UbR4c -Arg-pgal  [(B), 
immunoprecipitation  assays].  The  activity  of  Pgal  in  the  absence  of 
VBR 1  (vector  alone)  was  set  at  100%.  The  values  shown  are  the 
means  of  duplicate  measurements  of  three  independent  transformants. 
Standard  deviations  are  indicated  above  the  bars.  In  (B),  the  asterisk 
denotes  the  90  kDa  pgal  cleavage  product  characteristic  of  the  short¬ 
lived  X-pgals  (Bachmair  et  al,  1986).  Ubrlp  co-immunoprecipitated 
with  anti-Pgal  antibody  is  also  indicated.  The  measured  ratio  of 
Arg-pgal  to  DHFR-Ub  in  the  absence  of  Ubrlp  [(B),  lane  vector]  was 
set  at  1.00  and  used  to  normalize  the  other  ratios. 


Ubrlp  or  Ubc2p  (Dohmen  et  al. ,  1991),  was  nearly  absent 
with  UbrlpMR1  (Figures  4B  and  5B);  this  result  was 
consistent  with  negligible  functional  activity  of  UbrlpMR1. 

Previous  work  has  shown  that  the  N-end  rule  pathway 
controls  the  import  of  peptides  through  degradation  of  the 
homeodomain  protein  Cup9p,  which  inhibits  the  expres¬ 
sion  of  the  Ptr2p  peptide  transporter  (Byrd  et  al. ,  1998). 
We  examined  whether  cells  expressing  UbrlpMR1  instead 
of  wt  Ubrlp  were  perturbed  in  their  ability  to  import 
peptides  by  comparing  the  growth  of  lys2A  ubrl  A  cells 


6837 


Y.Xie  and  A.Varshavsky 


A 


IP:  anti-pgal 
1  2  3  4  5 

Ubrlp 
L-pgal 


IP:  anti-Flag 


jf  £  jf 


Fig.  5.  The  RING-H2  finger  of  Ubrlp  is  required  for  ubiquitylation  of 
N-end  rule  substrates.  (A)  The  RING-H2  finger  is  not  required  for  the 
binding  of  N-end  rule  substrates  by  Ubrlp.  The  UPR-based  fusion 
DHFRha-UbR48-Leu-pgal  (see  the  text  and  Materials  and  methods) 
was  co-expressed  in  AVY301  ( ubrlA  ubc2A )  cells  with  the  FLAG- 
tagged  wt  Ubrlp  (lane  3),  UbrlpMRI  (lane  4),  UbrlpMR2  (lane  5)  or 
vector  alone  (lane  2).  The  Ubrlp  proteins  were  expressed  from  the 
PADhi  promoter  in  a  high-copy  vector.  Lane  1,  AVY301  cells 
expressing  wt  Ubrlp  alone.  Extracts  from  cells  labeled  with 
[35S] methionine  were  immunoprecipitated  with  anti-pgal  (top  panel)  or 
anti-FLAG  antibodies  (lower  panel),  followed  by  SDS-8%  PAGE  (see 
Materials  and  methods).  (B)  The  RING-H2  finger  of  Ubrlp  is  required 
for  ubiquitylation  of  N-end  rule  substrates.  JD55  (ubrlA)  cells 
expressing  Leu-pgal  and  different  amounts  of  Ubrlp  or  its  mutants 
were  labeled  with  [35S] methionine  for  6  min  at  30°C,  followed  by  the 
addition  of  NEM  to  50  mM  and  preparation  of  extracts  by  heating  at 
100°C  in  the  presence  of  SDS  (see  Materials  and  methods).  The 
extracts  were  precipitated  with  anti-pgal  antibody  and  fractionated  by 
SDS-6%  PAGE.  Note  the  virtual  absence  of  the  multiubiquitylated 
Leu-pgal  species  (denoted  Ubn-pgal  on  the  right)  from  cells  expressing 
UbrlpMRI  (lane  3).  The  asterisk  denotes  the  90  kDa  pgal  cleavage 
product  characteristic  of  short-lived  X-pgals  (Bachmair  et  ai,  1986). 


expressing  either  wt  Ubrlp,  UbrlpMR1  or  UbrlpMB6  on 
plates  containing  Lys-Ala  dipeptide  as  the  sole  source  of 
lysine.  Under  these  conditions,  inactivation  of  the  N-end 
rule  pathway  is  lethal,  owing  to  the  inability  of  cells 
(auxotrophic  for  lysine)  to  import  the  Lys-Ala  dipeptide 
(Byrd  et  al .,  1998).  Cells  expressing  UbrlpMR1,  which  is 
virtually  inactive  as  N-recognin,  failed  to  grow  on  Lys- 
Ala  dipeptide  plates,  whereas  cells  expressing  either  wt 
Ubrlp  or  UbrlpMB6  survived  (data  not  shown),  consistent 
with  the  effects  of  the  MR1  and  MB  6  mutations  on  the 
activity  of  the  N-end  rule  pathway  (Figures  3  and  4). 

We  conclude  that  at  least  Cysl  of  the  RING-H2  finger 
in  Ubrlp  is  essential  for  the  E3  function  of  Ubrlp  in  the 


N-end  rule  pathway,  even  though  this  cysteine  is  not 
required  for  the  binding  of  Ubrlp  to  Ubc2p.  Yet  another 
insight  from  the  pulse-immunoprecipitation  assays  with 
Arg-pgal  was  that  the  anti-Pgal  antibody  co-precipitated 
the  partially  active  FLAG-UbrlpMR2  and  the  inactive 
FLAG-UbrlpMR1  from  the  extracts  of  cells  that  over¬ 
expressed  these  Ubrlp  mutants  (Figure  4B).  Thus,  the 
functionally  inactivating  Cysl^Serl  mutation  in  the 
RING-H2  finger  of  UbrlpMR1  may  not  have  impaired 
the  binding  of  substrates  by  UbrlpMR1.  This  conjecture  is 
confirmed  below. 

RING-H2  of  Ubrlp  is  not  required  for  the  binding 
of  N-end  rule  substrates  by  Ubrlp 

To  compare  directly  the  affinities  of  Ubrlp  and  its  deriva¬ 
tives  for  an  N-end  rule  substrate,  FLAG-Ubrlp,  FLAG- 
UbrlpMR1  or  FLAG-UbrlpMR2  were  overexpressed  in  a 
ubrlA  ubc2A  strain,  which  also  expressed  Leu-pgal  (pro¬ 
duced  from  Ub-Leu-Pgal).  Owing  to  the  absence  of 
Ubc2p,  Leu-Pgal  was  long-lived  in  this  strain,  irrespective 
of  the  presence  of  Ubrlp  (Dohmen  et  ah,  1991).  Controls 
included  congenic  cells  expressing  FLAG-Ubrlp  alone  or 
Leu-pgal  alone.  Cells  were  labeled  with  [35S]methionine 
for  30  min;  proteins  were  immunoprecipitated  from 
extracts  with  anti-Pgal  antibody,  followed  by  SDS-PAGE. 
Anti-Pgal  co-immunoprecipitated  Leu-pgal  and  compar¬ 
able  amounts  of  either  wt  FLAG-Ubrlp,  FLAG-UbrlpMR1 
or  FLAG-UbrlpMR2  (Figure  5A,  upper  panel,  lanes  3-5). 
This  co-immunoprecipitation  was  specific  for  Leu-pgal, 
because  no  FLAG-Ubrlp  was  brought  down  with  anti- 
Pgal  in  the  absence  of  Leu-pgal  (Figure  5A,  upper  panel, 
lane  1).  The  expression  levels  of  FLAG-Ubrlp,  FLAG- 
Ubrlp14111  and  FLAG-UbrlpMR2  were  nearly  identical,  as 
could  be  shown  by  immunoprecipitating  these  proteins 
with  anti-FLAG  antibody  (Figure  5A,  lower  panel).  In 
contrast  to  the  results  with  anti-pgal  antibody  (Figure  5A, 
upper  panel),  Leu- Pgal  was  not  co-precipitated  with  the 
FLAG- tagged  Ubrlp  proteins  by  anti-FLAG  antibody 
(Figure  5  A,  lower  panel).  The  probable  explanation  is  that 
under  these  conditions,  only  free  (overproduced)  FLAG- 
Ubrlp  proteins  could  be  immunoprecipitated  by  anti- 
FLAG,  because  in  the  Leu-pgal/FLAG-Ubrlp  complex 
Leu- Pgal  is  bound  to  the  type  2  substrate-binding  site 
of  Ubrlp,  located  in  the  N-terminal  region  of  Ubrlp 
(A.Webster,  M.Ghislain  and  A.Varshavsky,  unpublished 
data).  As  a  result,  the  N-terminus  of  Ubrlp  could  be 
shielded  from  recognition  by  the  bulky  Pgal  moiety  bound 
to  Ubrlp. 

Taken  together,  these  results  (Figures  1-5)  indicated 
that  the  MR1  (Cysl—>Serl)  mutation  in  the  RING-H2 
finger  of  Ubrlp  virtually  eliminated  the  E3  function  of 
Ubrlp,  but  did  not  impair  its  affinity  for  either  N-end  rule 
substrates  or  the  Ubc2p  Ub-conjugating  enzyme. 

The  RING-H2  finger  of  Ubrlp  is  required  for 
ubiquitylation  of  N-end  rule  substrates 

To  examine  ubiquitylation  of  an  N-end  rule  substrate  in 
cells  containing  mutant  Ubrlp  proteins,  FLAG-UbrlpMR1, 
FLAG-UbrlpMR2,  FLAG-UbrlpMR12,  FLAG-UbrlpMB6 
(Figure  IB)  and  the  wt  FLAG-Ubrlp  were  expressed  in 
a  ubrlA  UBC2  strain,  which  also  expressed  Leu-Pgal 
(produced  from  Ub-Leu-Pgal).  To  minimize  the  extent  of 
the  degradation  of  Leu-pgal  during  labeling,  the  duration 


6838 


RING-H2  finger  and  E3  function 


of  the  [35S]methionine  pulse  was  reduced  to  6  min.  At  the 
end  of  the  pulse,  high  concentrations  of  V-ethylmaleimide 
(NEM),  the  inhibitor  of  DUBs  (Johnsson  and  Varshavsky, 
1994),  were  employed  to  inhibit  deubiquitylation  after 
cell  lysis.  35S-labeled,  immunoprecipitated  Leu-pgal  was 
fractionated  by  SDS-PAGE  (Figure  5B). 

Multiubiquitylated  derivatives  of  Leu-pgal  were 
observed  in  the  presence  but  not  in  the  absence  of  wt 
Ubrlp,  and  the  extent  of  ubiquitylation  increased  upon 
overexpression  of  FLAG-Ubrlp  (Figure  5B,  lanes  1,  2 
and  9).  However,  virtually  no  ubiquitylated  Leu-Pgal  was 
observed  with  UbrlpMR1,  the  degradation-inactive 
RING-H2  mutant  of  Ubrlp  (Figure  5B,  lane  3,  compare 
with  lanes  1  and  2).  Cells  overexpressing  FLAG-UbrlpMR1 
contained  trace  amounts  of  multiubiquitylated  Leu-Pgal, 
in  comparison  with  much  higher  levels  of  these  species 
in  cells  overexpressing  wt  FLAG-Ubrlp  (Figure  5B, 
lane  7,  compare  with  lane  9).  In  agreement  with  the  results 
of  degradation  tests  (Figure  4),  the  partially  degradation- 
defective  RING-H2  mutant  UbrlpMR2  yielded  intermediate 
levels  of  the  multiubiquitylated  Leu-Pgal,  and  ubiquityl¬ 
ation  was  strongly  enhanced  upon  overexpression  of  FLAG- 
Ubrlp1^2  (Figure  5B,  lanes  4  and  8).  UbrlpMR12,  whose 
ability  to  mediate  the  degradation  of  N-end  rule  substrate 
was  similar  to  that  of  wt  Ubrlp  (Figure  4),  yielded  the 
ubiquitylation  pattern  indistinguishable  from  that  produced 
by  wt  Ubrlp  (Figure  5B,  lane  6).  Consistent  with  the  results 
of  pgal  activity  assays  (Figure  3),  UbrlpMB6,  which  bore 
three  mutations  in  the  BRR  domain  and  was  severely 
impaired  in  its  binding  to  Ubc2p  (Figures  1-3),  yielded 
nearly  wt  levels  of  multiubiquitylated  Leu-Pgal  (Figure  5B, 
lane  5,  compare  with  lane  2).  Taken  together,  these  results 
indicated  that  the  RING-H2  finger  of  Ubrlp  is  specifically 
required  for  the  Ubrlp,  Ubc2p-dependent  multiubiquity  1- 
ation  of  N-end  rule  substrates,  and  that  Cysl  of  the  RING- 
H2  finger  is  a  functionally  critical  residue  of  this  domain. 

Dominant-negative  effect  of  the  ubiquityiation- 
inactive  RING-H2  mutant  of  Ubrlp 

The  earlier  gel  filtration  data  suggested  that  Ubrlp  functions 
as  a  monomer  (Bartel,  1990).  In  addition,  the  immunopre- 
cipitation  of  FLAG-Ubrlp  with  anti-FLAG  antibody  from 
extracts  of  cells  that  overexpressed  both  FLAG-Ubrlp  and 
Ubrlp-HA  did  not  co-immunoprecipitate  Ubrlp-HA,  also 
suggesting  the  monomeric  configuration  of  Ubrlp 
(A.Webster  and  A. Varshavsky,  unpublished  data).  We  asked 
whether  the  functionally  inactive  UbrlpMR1  could  exert  a 
dominant-negative  effect  on  the  N-end  rule  pathway.  Either 
wt  Ubrlp  or  UbrlpMR1  (in  addition  to  vector  alone)  was 
overexpressed  from  the  Padhi  promoter  on  a  high-copy 
plasmid  in  UBR1  cells  that  also  expressed  the  UPR- 
based  fusions  which  produced  Arg-Pgal  (or  Leu-pgal)  and 
equimolar  amounts  of  DHFR-UbR48,  the  reference  protein. 
Cells  were  labeled  with  [35S]methionine  for  30  min,  fol¬ 
lowed  by  immunoprecipitation  of  Leu-Pgal  (Figure  6A)  or 
Arg-Pgal  (Figure  6B),  as  well  as  the  reference  protein 
DHFR-UbR48.  The  ratios  of  X-pgal  to  DHFR-UbR48  were 
measured,  and  thereafter  normalized  by  the  same  ratios  in 
congenic  ubrlA  cells  (Figure  6A  and  B,  lane  1).  Whereas 
overexpresion  of  wt  Ubrlp  in  UBR1  cells  accelerated  the 
degradation  of  Leu-Pgal  and  Arg-Pgal,  overexpression  of 
UbrlpMR1  strongly  inhibited  this  degradation  (Figure  6A 
and  B,  lanes  2  and  3,  compare  with  lanes  4).  The  dominant- 


Fig.  6.  Dominant-negative  effect  of  UbrlpMR!,  the  RING-H2  mutant 
of  Ubrlp.  The  Padhi  promoter-based,  high-copy  vector  alone  (lane  2), 
and  its  derivatives  expressing  the  FLAG-tagged  wt  Ubrlp  (lane  3)  or 
UbrlpMR1  (lane  4)  were  transformed  into  JD52  ( UBR1 )  cells 
expressing  the  UPR-based  fusion  DHFRha-UbR48-Leu-Pgal  (A)  or 
DHFRha-UbR48-Arg-pgal  (B).  Cells  were  labeled  with 
[35S] methionine  for  30  min  at  30°C,  followed  by  extraction, 
immunoprecipitation  with  a  mixture  of  anti-pgal  and  anti-HA 
antibodies,  and  SDS-12%  PAGE,  and  quantitation.  Lanes  1  in  (A)  and 
(B)  show  the  results  of  the  same  tests  with  JD55  (ubrlA)  cells  in  the 
absence  of  any  version  of  Ubrlp.  The  measured  ratio  of  the  test 
protein  (Leu-pgal  or  Arg-pgal)  to  the  reference  protein  (DHFRha- 
UbR48)  in  lanes  1  was  set  at  1 .00.  The  ratios  determined  in  the  other 
settings  and  normalized  against  the  lane-1  ratios  are  shown  below  the 
lanes. 


negative  effect  of  overexpressed  UbrlpMR1  was  caused  at 
least  in  part  by  the  sequestration  of  endogenous  Ubc2p, 
inasmuch  as  this  effect  could  be  partially  reversed  through 
overexpression  of  Ubc2p  (data  not  shown).  The  sequestra¬ 
tion  of  N-end  rule  substrates  by  UbrlpMR1,  which  could 
bind  to  substrates  indistinguishably  from  wt  Ubrlp 
(Figure  5),  was  another  likely  (and  independent)  cause  of 
the  dominant-negative  effect  of  UbrlpMR1  (Figure  6).  The 
possibility  that  overexpression  of  UbrlpMR1  might  also 
interfere  with  the  downstream  components  of  the  Ub  system, 
such  as  the  26S  proteasome,  was  ruled  out  by  the  observation 
that  the  degradation  of  Ub-Pro-Pgal  was  not  affected  by 
the  overexpression  of  UbrlpMR1  (data  not  shown).  Ub- 
Pro-Pgal  is  a  short-lived  protein  degraded  by  the  UFD 
(Ub/fusion/degradation)  pathway,  which  targets  the  non¬ 
removable  (or  slowly  removable)  N-terminal  Ub  moiety  of 
a  fusion  protein  (Johnson  et  al. ,  1995;  Koegl  et  al. ,  1999). 

Discussion 

We  report  two  main  results.  First,  we  identified  and  dissected 
the  E2  enzyme-binding  site  of  Ubrlp,  the  E3  component 
of  Ub-protein  ligase  in  the  N-end  rule  pathway  of  S.cerevis- 
iae.  This  site,  the  BRR  domain,  is  shown  to  bind  the  Ubc2p 
Ub-conjugating  (E2)  enzyme  mainly  through  electrostatic 
interactions  between  the  basic  residues  of  BRR  and  the 
polyacidic  C-terminal  tail  of  Ubc2p.  Although  high-affinity 
interactions  between  the  E3  Ubrlp  and  the  E2  Ubc2p  are 
essential  for  normal  activity  of  the  N-end  rule  pathway, 
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even  much  weaker  interactions  between  mutant  variants  of 
Ubrlp  and  Ubc2p,  the  interactions  which  could  not  be 
detected  by  either  two-hybrid  or  pull-down  assays,  were 
found  to  be  sufficient  for  sustaining  a  large  fraction  of  the 
pathway’s  activity.  Secondly,  we  discovered  the  requirement 
for  the  RING-H2  finger  of  Ubrlp  in  the  ubiquitylation  of 
substrates  mediated  by  the  Ubrlp-Ubc2p  complex.  A  single 
Cys-^Ser  missense  mutation  in  the  RING-H2  finger  (which 
is  adjacent  to  the  BRR  domain  in  Ubrlp)  severely  impairs 
ubiquitylation  and  degradation  of  N-end  rule  substrates,  but 
does  not  impair  the  ability  of  Ubrlp  to  bind  the  substrates 
and  Ubc2p.  These  findings  defined  the  physical  and  func¬ 
tional  aspects  of  the  Ubc2p-Ubrlp  interaction,  and  revealed 
a  specific  essential  function  of  the  RING-H2  finger.  This 
domain  is  present  not  only  in  the  members  of  the  UBR 
family  of  E3s  (Kwon  et  al. ,  1998)  but  also  in  otherwise 
dissimilar  E3  proteins  of  the  Ub  system. 

Physical  and  functional  aspects  of  E2-E3  interaction 

Ubrlp,  the  225  kDa  E3  protein,  was  shown  to  interact  with 
the  20  kDa  Ubc2p  E2  enzyme  mainly,  if  not  exclusively, 
through  the  BRR  domain  of  <15  kDa.  This  interaction 
requires  the  polyacidic  C-terminal  tail  of  Ubc2p  and  has  a 
strong  electrostatic  component,  because  conversions  of 
single  basic  residues  in  the  BRR  domain  into  neutral  ones 
resulted  in  comparable  decreases  in  the  affinity  of  Ubrlp 
for  Ubc2p.  Moreover,  this  affinity  was  decreased  in  rough 
proportion  to  the  number  of  mutated  basic  residues  in  BRR 
(Figures  1  and  2).  These  results  are  consistent  with  the 
reported  dependence  of  the  Ubrlp-Ubc2p  interaction  on 
the  length  of  the  polyacidic  tail  of  Ubc2p.  Specifically,  the 
truncated  Ubc2p  derivative  that  lacked  eight  of  23  residues 
of  the  polyacidic  tail  retained  high  affinity  for  Ubrlp,  but 
further  shortening  of  the  Ubc2p  polyacidic  tail  strongly 
decreased  the  affinity  (Madura  et  al. ,  1993). 

Previous  work  suggested  that  the  9  residue  N-terminal 
region  of  Ubc2p  was  essential  for  the  binding  of  Ubc2p  to 
Ubrlp  (Watkins  et  al,  1993).  Our  current  data  demonstrated 
that  deletion  of  the  first  9  residues  of  Ubc2p  weakened  but 
did  not  eliminate  its  interaction,  in  the  two-hybrid  assay, 
with  the  287  residue  fragment  of  Ubrlp  that  contained  the 
BRR  domain  and  the  RING-H2  finger  (Figure  1).  It  is  not 
clear  how  the  N-terminal  9  residues  of  Ubc2p  contribute 
to  the  interaction  with  Ubrlp.  Inasmuch  as  Arg6  and  Arg8 
of  the  folded  Ubc2p  are  likely  to  interact  through  a  hydrogen 
bond  (Worthylake  et  al. ,  1998),  it  is  possible  that  the  effect 
of  the  9  residue  N-terminal  deletion  is  indirect,  stemming 
from  a  conformational  destabilization  of  Ubc2p.  Our  finding 
of  the  287  residue  Ubrlp  fragment  that  contains  the  BRR 
and  RING-H2  domains  and  is  sufficient  for  the  high-affinity 
binding  to  Ubc2p  (Figures  1  and  2)  should  facilitate  the 
dissection  of  this  E2-E3  interaction  at  the  atomic  level 
through  co-crystallization  of  the  previously  solved  Ubc2p 
(Worthylake  et  al ,  1998)  and  the  287  residue  fragment 
of  Ubrlp. 

Function  of  the  RING-H2  finger  of  Ubrlp 

One  of  the  major  results  of  this  work  is  the  discovery  of 
an  essential  and  specific  requirement  for  the  RING-H2 
finger  of  Ubrlp  in  the  ubiquitylation  of  N-end  rule  substrates 
mediated  by  the  Ubrlp-Ubc2p  complex.  The  RING-H2 
motif  is  a  distinct  variant  of  the  previously  defined  Cys/ 
His-rich  RING  motif.  In  several  proteins  outside  the  Ub 


system,  RING-H2  has  been  shown  to  function  as  a  site  ©f 
protein-protein  interactions  (Borden  and  Freemont,  1996; 
Inouye  et  al. ,  1997;  Kamura  et  al. ,  1999).  In  addition  to 
the  UBR  family  of  E3  proteins  (Kwon  et  al ,  1998),  several 
other  E3  components  of  the  Ub  system,  including  Ape  lip, 
Roclp  (Rbxlp  or  Hrtlp),  Roc2p  and  their  homologs  in 
multicellular  eukaryotes,  have  also  been  found  to  contain 
RING-H2  or  other  variants  of  RING  (Yu  et  al. ,  1998; 
Zachariae  et  al .,  1998;  Ohta  et  al .,  1999;  Seol  et  al.,  1999; 
Skowyra  et  al.,  1999;  Tan  et  al.,  1999). 

The  RING-H2  of  Ubrlp  is  adjacent  to  the  BRR  domain, 
which  mediates  the  physical  interaction  between  Ubrlp  and 
Ubc2p  (Figures  1  and  2).  A  single  Cys— >Ser  mutation 
at  position  1  of  the  RING-H2  finger  abolished  ubiquityl¬ 
ation  and  degradation  of  N-end  rule  substrates,  but  did 
not  impair  the  ability  of  the  resulting  mutant,  UbrlpMR1, 
to  bind  both  the  substrates  and  Ubc2p  (Figures  4-6).  The 
Cys— >Ser  mutations  at  other  positions  of  the  RING-H2 
finger  had  similar  but  partial  effects,  while  the  His— > Ala 
mutation  at  Hisl  position  of  RING-H2  had  the  smallest 
effect  among  the  mutants  tested  (Figures  4-6).  Thus,  even 
though  the  Ubc2p  E2  enzyme  can  interact  with  the  El 
enzyme  to  yield  the  Ubc2p~Ub  thioester,  and  can  also 
produce  isopeptide  bond-mediated  Ub-protein  conjugates 
in  the  absence  of  E3  (Jentsch  et  al.,  1987),  the  synthesis 
of  a  multi-Ub  chain  by  the  Ubc2p-Ubrlp  complex  requires 
the  presence  of  cysteine  at  position  1  of  the  RING-H2 
finger  in  Ubrlp. 

The  Ub  system  of  a  cell  contains  a  number  of  distinct 
E2  and  E3  proteins  (Hochstrasser,  1996;  Varshavsky,  1997; 
Hershko  and  Ciechanover,  1998).  An  E3  protein  of  a  Ub- 
dependent  proteolytic  pathway  was  initially  presumed  to 
function  as  a  non-catalytic,  substrate-recognition  compon¬ 
ent  of  the  corresponding  Ub-protein  ligase,  which  consists 
of  an  E3  and  its  associated  E2  enzyme.  Ubrlp  (N-recognin) 
of  the  present  work  was  the  first  E3  that  has  been 
shown  to  bind  proteins  bearing  specific  degradation  signals 
(Bachmair  et  al.,  1986;  Bartel  et  al.,  1990).  More  recently, 
it  was  found  that  in  addition  to  recognizing  specific 
substrates,  at  least  some  E3s,  specifically  those  that  contain 
the  HECT  domain  (Scheffner  et  al.,  1993),  also  function 
as  enzymes.  An  E3  of  this  class  can  accept  the  Ub  moiety 
from  an  associated  E2~Ub  thioester,  initially  forming  the 
E3~Ub  thioester  confined  to  a  specific  (and  essential) 
cysteine  of  E3.  The  Ub  moiety  of  this  E3~Ub  thioester  is 
thereafter  conjugated,  through  the  isopeptide  bond,  either 
directly  to  a  substrate  protein  or  to  the  Ub  moiety  of  a 
growing,  substrate-linked  multi-Ub  chain  (Scheffner  et  al., 
1995).  Other  classes  of  E3  proteins,  including  the  SCF, 
APC  and  UBR  families  (Hershko  and  Ciechanover,  1998; 
Kwon  et  al.,  1998;  Tyers  and  Willems,  1999),  lack 
recognizable  HECT  domains,  and  have  not  thus  far  been 
demonstrated  to  form  thioesters  with  Ub. 

The  mutations  tested  in  the  RING-H2  finger  of  Ubrlp 
(including  the  Cysl  mutation  which  abolished  the  ubi¬ 
quitylation  activity  of  the  Ubrlp-Ubc2p  complex)  did  not 
impair  the  ability  of  Ubrlp  to  bind  the  Ubc2p  enzyme 
and  N-end  rule  substrates  (Figures  1-5).  Thus,  the  specific 
functional  effect  of  these  mutations  cannot  be  due  to 
a  global  conformational  destabilization  of  Ubrlp.  We 
conclude  that  the  Ubrlp  (E3)  component  of  the  N-end 
rule  pathway  not  only  recognizes  specific  degradation 
signals  in  substrate  proteins,  but  in  addition  directly 
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participates,  through  a  mechanism  that  requires  the 
RING-H2  domain,  in  the  Ubc2p-mediated  synthesis  of  a 
substrate-linked  multi-Ub  chain.  In  the  first  of  possible 
models,  a  specific  cysteine  of  RING-H2,  perhaps  the 
essential  Cysl  residue  (Figure  1),  acts  as  an  acceptor  of 
the  Ub  moiety  transesterified  from  the  Ubc2p~Ub  thioester, 
similar  to  the  one  previously  described  mechanism  of  the 
HECT-domain  E3  called  E6AP  (Scheffner  et  al ,  1995; 
Huibregtse  et  al ,  1998).  Earlier  experiments  with  an 
in  vitro  ubiquitylation  system  containing  purified  S.cerevis- 
iae  Ubrlp  and  other  targeting  components  of  the  N-end 
rule  pathway  suggested  the  Ubc2p-dependent  formation 
of  a  Ubrlp~Ub  thioester  (V.Chau  and  A. Varshavsky, 
unpublished  data).  Attempts  to  rigorously  verify  the  func¬ 
tionally  relevant  presence  of  such  a  thioester  in  the 
completely  defined,  Ubrlp-based  in  vitro  system  have 
not  been  successful  thus  far  (F.Du  and  A. Varshavsky, 
unpublished  data).  Another  possible  role  of  the  cysteine(s) 
of  the  RING-H2  finger  in  Ubrlp  may  be  to  modify  the 
immediate  vicinity  of  the  active  cysteine  (Cys88)  of  the 
Ubrlp-associated  Ubc2p  in  a  way  that  facilitates  the  multi- 
Ub  synthesis  by  Ubc2p,  in  the  absence  of  Ubrlp~Ub 
thioester  formation.  Physical  proximity  between  the 
Ubc2p-binding  BRR  domain  of  Ubrlp  and  its  RING-H2 
domain  (Figure  1A)  is  consistent  with  this  mechanism.  A 
third  model  posits  the  existence  of  an  unidentified  factor 
that  binds  to  the  RING-H2  finger  of  Ubrlp,  and  is  required 
for  the  efficient  synthesis  of  multi-Ub  by  the  Ubrlp- 
Ubc2p  complex.  Jentsch  and  colleagues  (Koegl  et  al , 
1999)  have  identified  a  factor,  termed  E4,  which  facilitates 
the  synthesis  of  a  multi-Ub  chain  linked  to  a  substrate  of 
the  S.cerevisiae  UFD  pathway.  This  factor,  encoded  by 
the  UFD2  gene,  is  required  for  the  in  vivo  degradation  of 
UbV76-Val-(3gal,  but  is  not  required  for  the  degradation  of 
a  much  smaller  and  monomeric  UFD  substrate  such  as 
Ubv76-Val-DF[FR;  Ufd2p  is  also  not  required  for  the 
degradation  of  pgal-based  N-end  rule  substrates  (Johnson 
et  al ,  1995).  Whether  the  function  of  the  RING-H2  finger 
of  Ubrlp  is  mediated  by  a  finger-binding  protein  analogous 
to  E4  remains  to  be  determined. 

The  RING-H2  finger  and  its  variants  are  present  in 
several  otherwise  dissimilar  classes  of  E3  proteins  in  the 
Ub  system.  For  example,  the  mammalian  E3  encoded  by 
the  Mdm2  gene  and  specific  for  the  p53  substrate  contains 
a  putative  RING  finger  (Elenbaas  et  al,  1996);  the 
conserved  Cys464  residue  of  this  domain  was  proposed 
to  define  an  active  site  analogous  to  the  Ub  thioester- 
forming  site  of  the  HECT-domain  E3  enzymes  (Honda 
et  al,  1997).  Roclp  (Rbxlp,  Hrtlp),  a  recently  identified 
subunit  of  the  cell  cycle-regulating  Ub-protein  ligase 
complex  called  SCF,  contains  a  variant  of  RING-H2  in 
which  Cys6  is  replaced  by  Asp  (Kamura  et  al,  1999). 
Mutagenesis-based  analyses  suggested  that  the  RING 
finger  of  Roclp  was  required  for  the  function  of  the  SCF 
complex  (Kamura  et  al,  1999;  Ohta  et  al,  1999).  The 
sequence  of  Roclp  (Rbxlp,  Hrtlp)  is  highly  similar  to 
that  of  Ape  lip,  the  RING-H2-containing  subunit  of 
another  cell  cycle-regulating  Ub-protein  ligase,  called 
APC  or  cyclosome  (Yu  et  al,  1998;  Zachariae  et  al, 
1998).  The  RING-H2  finger  and  its  variants  have  also 
been  found  in  other  (possibly  also  E3)  components  of  the 
Ub  system,  including  Siah-1  and  Der3p  (Hrdlp),  which 
are  required,  respectively,  for  the  degradation  of  the  DCC 


protein  (deleted  in  colorectal  cancer)  (Hu  and  Fearon, 
1999)  and  of  the  misfolded  lumenal  and  integral  membrane 
proteins  (Bordallo  et  al,  1998). 

Further  dissection  of  the  RING  domains  in  these  and 
other  E3  proteins  will  determine  whether  the  ubi- 
quitylation-enabling  function  of  RING-H2  in  Ubrlp  of 
the  N-end  rule  pathway  is  a  general  property  of  the  E3 
components  in  the  Ub  system. 

Materials  and  methods 

Strains ,  media  and  p-gaiactosidase  assay 

The  S.cerevisiae  strains  used  were  JD55  (MATa  trpl-A63  ura3-52  his3- 
A 200  leu2-3,  112  lys2-801  ubrlA::HlS3)\  JD52  (MATa  trpl-A63  ura3- 
52  his3-A200  leu2~3 ,  112  lys2-801 );  AVY301  (a  ubc2A::VRA3  derivative 
of  JD55);  and  PJ69-4A  (a  gift  from  P.James  and  E.Craig)  (MATa  trpl-109 
leu2-3,  112  ura3-52  his3-A200  gal4A  gal80A  GAL2-ADE2  LYS2::GAL1- 
H1S3  met::GAL7-lacZ).  Cultures  were  grown  in  rich  (YPD)  or  in 
synthetic  media  containing  standard  ingredients  (Sherman,  1991)  and 
either  2%  glucose  (SD  medium),  2%  galactose  (SG  medium)  or  2% 
raffinose.  Yeast  were  transformed  as  described  previously  (Chen  et  al., 
1992).  For  the  induction  of  the  Pcupi  promoter,  CuS04  was  added  to  a 
final  concentration  of  0.1  mM  (Ghislain  et  at.,  1996).  The  enzymatic 
activity  of  pgal  in  yeast  extracts,  from  cultures  at  OD60o  of  0.8-1. 0,  was 
determined  as  described  previously  (Baker  and  Varshavsky,  1991),  using 
fl-nitrophenyl-p-D-galactopyranoside  (ONPG).  The  Escherichia  coli 
strain  used  was  DH5a. 

Uses  of  the  two-hybrid  system 

The  UBC2  ORF  was  amplified  by  PCR  using  pKM1313  (Madura  and 
Varshavsky,  1994)  as  the  template  DNA.  The  two  primers  were  designed 
so  that  the  PCR  products  contained  a  Ncol  site  immediately  5'  to  the 
start  codon,  and  a  Sail  site  immediately  3'  to  the  stop  codon.  The  Ncol - 
Sall-cut  PCR  fragment  was  subcloned  into  Ncol-Sall-cut  PAS2  (Durfee 
et  al.,  1993),  yielding  pAS2-UBC2,  which  expressed  a  fusion  containing 
the  DNA-binding  domain  of  Gal4p  upstream  of  the  Ubc2p  moiety. 
The  construct  was  verified  by  nucleotide  sequencing,  and  also  by 
immunoblotting,  in  which  the  Gal4-Ubc2p  protein  was  detected  with  an 
anti-Gal4  antibody  (Santa  Cruz  Biotechnology,  Santa  Cruz,  CA).  Gal4- 
Ubc2p  retained  the  ability  of  Ubc2p  to  rescue  the  UV-sensitivity  defect 
of  ubc2A  cells.  The  PCR  primers  used  were  ggaaccatggagatgtccacacca- 
getagaag  and  ccttgtcgactcagtctgcttcgtcgtcgt.  The  two-hybrid  DNA  librar¬ 
ies  and  the  host  S.cerevisiae  strain  PJ69-4A  were  gifts  from  P.James  and 
E.Craig  (James  et  al. ,  1996).  PJ69-4A  expressed  three  reporters 
from  different  inducible  promoters:  Pgal \-NlS3,  ^q/^2-ADE2,  and 
Pgal7 -lacZ.  pAS2-UBC2  was  transformed  into  PJ69-4A.  Trp+  trans¬ 
formants  were  then  transformed  with  the  two-hybrid  libraries,  with 
selection  directly  on  SD  (-trp,  -leu,  -ade)  plates  at  30°C.  Ade+  colonies 
were  then  tested  on  SD  (-trp,  -leu,  -his)  plates.  Library  plasmids  were 
rescued  from  the  Ade+  His+  transformants,  and  re-transformed  into 
PJ69-4A  expressing  either  pAS2-UBC2  or  the  control  vector  pAS2.  The 
plasmids  that  induced  ADE2  and  H1S3  in  the  presence  but  not  in  the 
absence  of  pAS2-UBC2-encoded  putative  Ubc2p-interacting  proteins, 
and  were  analyzed  by  partial  sequencing.  One  clone  thus  identified 
encoded  a  287  residue  fragment  (position  1134-1420)  of  the  ORF 
YLR024C,  which  was  previously  termed  UBR2  (Hochstrasser,  1996; 
Kwon  et  al.,  1998),  as  described  in  Results. 

The  two-hybrid  system  was  also  used  to  qualitatively  assess  the 
interaction  of  either  Ubc2p  or  its  truncated  derivatives  with  the  BRR 
domain  and/or  the  RING-H2  finger  of  either  wt  Ubrlp  or  its  derivatives 
containing  point  mutations.  UBC2I_}49,  encoding  the  C-terminally  trun¬ 
cated  Ubc2p  lacking  the  23  residue  polyacidic  tail,  was  constructed 
using  PCR  and  the  primers  ggaaccatggagatgtccacaccagctagaag  and 
gctagtcgacccaagatttctctaccgtctc.  The  PCR  products  were  cut  by  Sail  and 
BamUl,  and  subcloned  into  Sa/I-Zto/nHI-cut  pGBDUCl  (James  et  al., 
1996),  yielding  pGBDUCl UBC2M49,  which  expressed  a  fusion  of 
Ubc2p!_149  to  the  DNA  binding  domain  of  Gal4p.  Similarly, 
UBC2 io-i72 .  encoding  Ubc2p  lacking  the  first  9  residues,  was  constructed 
using  the  primers  tacgggatccatgagagattttaaacgtatg  and  ccttgtcgactcagtctg¬ 
cttcgtcgtcgt.  The  PCR  products  were  digested  with  Sail  and  BamUl,  and 
subcloned  into  Sall-BamUl-cuX  pGBDUCl  vector,  yielding  pGBDU¬ 
Cl  UBC2j 0_ i72-  The  UBR1  fragment  encoding  residues  1081-1367, 
including  the  BRR  domain  and  the  RING-H2  finger,  was  produced 
using  PCR  and  the  primers  aactggatccggtaaggactctataccagaagct  and 
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ttccgtcgacctcagtaaaaggtttgaagagcct.  The  PCR  products  were  cut  with 
BamHl  and  Sail,  and  subcloned  into  BamHl-Sall-cut  pGADCl  (James 
et  al,  1996),  yielding  pGADC  1  UBR  l108i-i367>  which  encoded  a  fusion 
of  Ubrlpio8i_i367  to  the  transcriptional  activation  domain  of  Gal4p. 
UBRl  108I-122O’  encoding  the  Ubrlp  fragment  containing  the  BRR  domain 
but  not  the  RING-H2  finger,  was  constructed  using  the  primers 
tacgggatccatgagagattttaaacgtatg  and  ttccgtcgacgcaggtaaaatcctccgattc,  the 
above  procedure,  and  yielding  pGADC  1  UBR  l]08i~i  220  which  expressed 
a  fusion  of  Ubrlpio8i~i220  to  the  transcriptional  activation  domain  of 
Gal4.  UBRl }177_j fa?,  encoding  the  Ubrlp  fragment  containing  the 
RING-H2  finger  but  not  the  BRR  domain,  was  constructed  using  the 
primers  aactggatccgcaaggcttctagccaag  and  ttccgtcgacgcaggtaaaatcctcc- 
gattc,  the  above  procedure,  and  yielding  pGADC  lUBRlj^.^,  which 
expressed  a  fusion  of  Ubrlp!  177_i367  to  the  transcriptional  activation 
domain  of  Gal4.  UBRl  alleles  containing  point  mutations  in  either  the 
BRR  domain  or  the  RING-H2  finger  were  produced  using  the 
QuickChange  mutagenesis  kit  (Stratagene,  La  Jolla,  CA).  Briefly,  an 
-850  bp  EcoRl-Bglll  fragment  from  pGADClUBRl108i-i367  that 
encoded  the  BRR  domain  and  the  RING-H2  finger  was  subcloned  into 
the  £c<?RI-Bg/II-cut  LITMUS28  vector  (New  England  Biolabs,  Beverly, 
MA),  yielding  LITMUS28UBRl(£’ce>RI-£g/II),  which  was  subjected  to 
site-directed  mutagenesis.  The  resulting  plasmids  were  cut  with  EcoRl 
and  Bglll,  and  the  -850  bp  EcoRl-Bglll  fragments  containing  the 
intended  point  mutations  were  subcloned  back  into  £coRI-Bg/II-cut 
pGADClUBRl108i_i367?  yielding  pGADClUBRIMB  10,  pGADC  1UBR- 
1MB11,  pGADClUBRlMB17,  pGADClUBRlMB4,  pGADClUBRl- 
MB5,  and  PGADC1UBR1MB6;  pGADC  1 UBR 1  MR  1 ,  pGADClUB- 
R1MR2,  pGADC  1  UBR  1MR3,  pGADClUBRlMR12,  pGADClUBRl- 
MR1 8,  pGADC  1  UBR  1 MR7  and  pGADC  1UBR1MR8.  All  PCR  products 
and  inserts  containing  point  mutations  were  verified  by  DNA  sequencing 
on  both  strands.  These  mutants  belonged  to  either  the  MB  or  the  MR 
groups  (Figure  IB). 

UPR  plasmids  encoding  N-end  rule  substrates 

UPR  plasmids  were  the  yeast  vector-based  counterparts  of  the  plasmids 
originally  constructed  for  expression  in  mammalian  cells  (Levy  et  al., 
1996).  These  plasmids,  termed  pRpgalUPR  and  pLpgalUPR,  expressed, 
respectively,  DHFR-Ub- Arg-  pgal  and  DHFR-Ub-Leu-pgal  from  the 
PCupi  promoter.  A  BainHl-Smal  insert  of  pBARUPR  or  pBALUPR  was 
replaced  with  a  3.8  kb  Pgal-encoding  insert  from  pUB23-M  (Bachmair 
et  al.,  1986),  using  BamHl  and  Seal.  pBARUPR  and  pBALUPR  were 
derived  from  the  low-copy,  pRS314-based  pBAM  vector  (Ghislain  et  al., 
1996).  They  expressed,  respectively,  DHFR-Ub-Arg-Ura3p  and  DHFR- 
Ub~Leu-Ura3p  from  the  Pcupi  promoter  (A. Webster  and  A. Varshavsky, 
unpublished  data),  and  were  a  gift  from  A. Webster. 

Plasmids  encoding  Ubrlp ,  Ubc2p  and  their  mutant 
derivatives 

UBRl  and  its  mutant  alleles  were  expressed  either  from  its  natural 
promoter  in  a  low-copy,  pRS315-derived  vector  or  from  the  Padhi 
promoter  in  a  high-copy,  YEplacl81-based  plasmid.  Briefly,  pRS315 
(Sikorski  and  Hieter,  1989)  was  cut  with  Xbal,  blunted  by  Klenow  Pol  I, 
and  thereafter  self-ligated,  yielding  pRS3\5(-XbaT)  lacking  the  Xbal 
site.  pUBRl  (Bartel  et  al,  1990)  was  cut  with  Sail  and  Pstl.  An  -8.1  kb 
insert  containing  the  ORF  and  promoter  of  UBRl  was  subcloned 
into  Sall-Pstl-cul  pRS315(-AM),  yielding  pRS315(-XM)UBRl.  To 
produce  mutant  ubrl  alleles,  a  -2.9  kb  BgFll-Xbal  fragment  of  UBRl 
from  pRS315(-XM)UBRl  was  subcloned  into  LITMUS28.  The 
resulting  LITMUS28UBRl(£g/II-A2?aI)  was  digested  with  Hpal  and 
Sapl,  and  an  -1.4  kb  fragment  was  isolated.  LITMUS28UBRl(Rg/II- 
Xbal)  was  also  cut  by  Sapl,  and  an  -3.7  kb  band  was  isolated.  The  1.4 
and  3.7  kb  fragments  were  ligated  with  a  set  of  -600  bp  Sapl-Hpal 
fragments  from  the  point  mutation-containing  pGADC  1  UBR  1MB4, 
pGADC  1 UB  R 1  MB  6,  pGADC  1  UBR  1  MR  1,  pGADC  1  UBR  1MR2, 
pGADC  1 UB R 1 MR3 ,  pGADC  1  UBR  1  MR  12  or  pGADClUBRlMR18, 
yielding,  respectively,  pLITSUM28UBRlMB4,  pLITMUS28UBRl- 
MB6,  pLITMUS28UBRlMRl,  pLITMUS28UBRlMR2,  pLITMUS28- 
UBR1MR3,  pLITMUS28UBR  1  MR  1 2  and  pLITMUS28UBRlMR18.  A 
2.9  kb  BgRl-Xbal  fragment  from  each  of  these  plasmids  was  subcloned 
into  Bgtll-Xbal-c\x\  pRS3 1 5(-X/?aI)UBR  1 ,  yielding,  respectively, 
pRS315(-AM0UBRlMB4,  pRS315(-AM)UBRlMB6,  pRS315 
(-XM)UBRIMRI,  pRS3 1 5 (-XfozI)UBR  1 MR2,  pRS315(-XM)UBRl- 
MR3,  pRS 3 1 5 (-Xbal)UB R 1  MR  1 2  and  pRS315(-Xfod)UBRlMR18. 
pNTFlagUBRl  expressed  the  full-length  Ubrlp  bearing  the  N-terminal 
FLAG  epitope  (MDYKDDDDK)  from  the  PAdhi  promoter  in  a  high- 
copy,  LEU2-containing  vector.  The  MB6,  MR1  or  MR2  mutations 
(Figure  IB)  were  introduced  into  pNTFlagUBRl  by  replacing  its  2.9  kb 


Bglll-Xbal  insert  with  its  counterparts  from  pRS315(-XfotI)UBRlMB6, 
pRS315(~AM)UBRlMRl  or  pRS315(-XM)UBRlMR2,  yielding, 
respectively,  pNTFlagUBRl  MB  6,  pNTFlagUBRl  MR  1  and  pNTFlag- 
UBR1MR2. 

UBC2  and  UBC2}_ }49  alleles  containing  the  native  Pubc2  promoter 
(-500  bp  of  the  5'  region)  were  amplified  by  PCR.  The  SacI-SM-cut 
PCR  products  were  subcloned  into  Sacl-SaB-cut  p413GALl  (Mumberg 
et  al.,  1994).  Both  wt  Ubc2p  and  Ubc2p1_l49  expressed  from  these 
vectors  were  able  to  complement  the  UV-sensitivity  phenotype  of 
ubc2A  cells  (data  not  shown).  The  PCR  primers  for  the  wt  UBC2  and 
the  ubc2]_}49  were,  respectively,  acgtgagctcccctaattgaattgttaagcg  and 
ccttgtcgactcagtctgcttcgtcgtcgt,  and  acgtgagctcccctaattgaattgttaagcg  and 
gctagtcgactcaccaagatttctctaccgtctc. 

GST  binding  assays 

The  purified  GST-Ubc2p  fusion  protein  (-1  pg)  was  diluted  into  0.5  ml 
of  the  loading  buffer  (1%  Triton  X-100,  10%  glycerol,  0.5  M  NaCl, 
1  mM  EDTA,  50  mM  Tris-HCl  pH  8.0),  incubated  with  10  pi  (bed 
volume)  of  the  glutathione-agarose  beads  (Sigma,  St  Louis,  MO)  for 
1  h  at  4°C,  and  washed  three  times  with  1  ml  of  the  binding  buffer 
(10%  glycerol,  50  mM  NaCl,  50  mM  Na-HEPES  pH  7.5).  Washed 
beads  were  incubated  with  extracts  from  35S-labeled  cells 
(-5  X  106  CCl3COOH-insoluble  counts/min/extract)  containing  the  vec¬ 
tor  alone  or  a  plasmid  expressing  one  of  the  UBR1  alleles,  including  wt 
FLAG-Ubrlp  (specifically,  pNTFlagUBRl,  pNTFlagUBRl  MB  6, 
pNTFlagUBRIMRl  or  pNTFlagUBRl MR2)  at  4°C  for  1  h.  The  incuba¬ 
tion  buffer  contained  1  mg/ml  of  the  ovalbumin  carrier  (Sigma,  St  Louis, 
MO)  in  the  binding  buffer.  Beads  were  washed  three  times  with  the 
binding  buffer.  Bound  proteins  were  eluted  with  the  loading  buffer 
containing  2%  SDS  and  fractionated  by  SDS-8%  PAGE. 

Co-immun  oprecipita  tion-immun  oblotting 

Ubc2p  bearing  the  N-terminal  HA  epitope  was  expressed  from  the  Pgali 
promoter  in  a  low-copy  number  vector  p416GALl  (Mumberg  et  al, 
1994).  AVY301  cells  (ubrl A  ubc2A)  containing  a  plasmid  expressing 
HA-tagged  Ubc2p  or  a  plasmid  expressing  FLAG-Ubrlp,  or  plasmids 
expressing  HA-tagged  Ubc2p  and  one  of  the  UBR1  alleles,  including 
pNTFlagUBRl,  pNTFlagUBRl  MB  6,  pNTFlagUBRIMRl  or  pNTFlag¬ 
UBRl  MR2  were  cultured  in  raffinose  medium  to  an  OD600  of  0.8-1. 0, 
then  switched  to  galactose  medium  for  induction  of  Ubc2p  expression 
for  6  h.  Cells  were  harvested  and  resuspended  in  DB  buffer  (50  mM 
NaCl,  1  mM  EDTA,  50  mM  Na-HEPES  pH  7.5)  containing  1 X  protease 
inhibitor  mix  (Boehringer  Mannheim),  and  thereafter  lysed  by  vortexing 
with  glass  beads  (Bartel  et  al,  1990).  For  the  co-immunoprecipitation 
of  Ubc2p  with  Ubrlp  (Figure  2B,  top  panel),  supernatants  were  incubated 
with  anti-HA  antiserum  (Babco,  Berkeley,  CA),  followed  by  SDS-8% 
PAGE  and  immunoblotting  with  a  monoclonal  anti-FLAG  antibody 
(Sigma,  St  Louis,  MO).  The  expression  levels  of  Ubrlp  and  Ubc2p  in 
these  transformants  were  monitored  by  immunoblotting  with  anti-FLAG 
antibody  (Figure  2B,  middle  panel)  and  anti-HA  antibody  (Figure  2B, 
bottom  panel),  respectively.  SDS-8%  PAGE  was  used. 

Metabolic  labeling  and  immunoprecipitation 

Saccharomyces  cerevisiae  cells  from  10  ml  cultures  (ODgoo  0.8-1. 2)  in 
SD  media  containing  0.1  mM  CuS04  were  labeled  for  30  min  at  30°C 
with  0.15  mCi  of  [35S] methionine/cysteine  (EXPRESS,  New  England 
Nuclear,  Boston,  MA).  The  same  conditions  were  used  to  label  FLAG- 
Ubrlp  and  its  derivatives  for  the  GST  pull-down  assays,  except  that  no 
CuS04  was  added.  To  assay  the  in  vivo  ubiquitylation  of  N-end  rule 
substrates,  cells  were  labeled  for  6  min,  followed  by  the  addition  of  2  M 
NEM  in  dimethylformamide  to  a  final  concentration  of  50  mM  (Johnsson 
and  Varshavsky,  1994).  Two  methods  were  used  to  prepare  cell  extracts. 
For  the  GST  pull-down  assays,  cell  extracts  were  prepared  as  described 
above  for  co-immunoprecipitation  immunoblots.  For  the  experiments 
shown  in  Figures  3C,  4B,  5A  and  6,  labeled  cells  were  pelleted,  and 
resuspended  in  0.8  ml  of  buffer  A  (1%  Triton  X-100,  0.15  M  NaCl, 

1  mM  EDTA,  50  mM  Na-HEPES  pH  7.5)  containing  protease  inhibitors, 
and  were  lysed  as  above.  The  extracts  were  centrifuged  at  12  000  g  for 
10  min,  and  supernatants  containing  equal  amounts  of  CCl3COOH- 
insoluble  35S  were  used  for  the  GST  assays  described  above  or  for 
immunoprecipitations.  To  detect  Ubrlp  (Figures  2 A  and  5 A,  lower 
panels),  immunoprecipitations  were  carried  out  with  a  monoclonal  anti- 
FLAG  antibody  conjugated  to  agarose  beads  (Kodak,  Rochester,  NY), 
followed  by  SDS-8%  PAGE.  For  the  co-immunoprecipitation  of  Leu- 
pgal  with  Ubrlp  (Figure  5 A,  top  panel),  supernatants  were  incubated 
with  a  monoclonal  anti-pgal  antibody  (Promega,  Madison,  WI),  followed 
by  SDS-8%  PAGE.  For  measurements  of  X-pgal  stability  using  the 
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UBR  technique  (Figures  3C,  4B  and  6),  a  mixture  of  an  anti-HA 
antiserum  (Babco,  Berkeley,  CA)  and  the  monoclonal  anti-pgal  antibody 
was  incubated  with  35S-labeled  cell  extracts.  Immunoprecipitated  proteins 
were  separated  by  SDS-12%  PAGE.  For  detection  of  the  in  vivo 
ubiquitylation,  cells  labeled  for  6  min  were  lysed  using  the  SDS-boiling 
method  (Hochstrasser  and  Varshavsky,  1990).  Briefly,  a  pellet  of  NEM- 
treated  cells  (see  above)  was  mixed  with  an  equal  volume  of  the  2X 
SDS  buffer  at  100°C  (2%  SDS,  30  mM  dithiothreitol,  90  mM  Na- 
HEPES  pH  7.5)  plus  50  mM  NEM,  and  incubated  at  100°C  for  5  min. 
The  extracts  were  diluted  10-fold  with  buffer  A  plus  protease  inhibitors 
and  50  mM  NEM.  CCl3COOH-insoluble  35S  was  determined,  and 
samples  containing  equal  amounts  of  35S  were  immunoprecipitated  with 
the  monoclonal  anti-pgal  antibody,  followed  by  SDS-6%  PAGE. 
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The  N-end  rule  relates  the  in  vivo  half-life  of  a  protein 
to  the  identity  of  its  N-terminal  residue.  Ubrlp,  the  rec¬ 
ognition  (E3)  component  of  the  Saccharomyces  cerevi - 
siae  N-end  rule  pathway,  contains  at  least  two  substrate¬ 
binding  sites.  The  type  1  site  is  specific  for  N-terminal 
basic  residues  Arg,  Lys,  and  His.  The  type  2  site  is  spe¬ 
cific  for  N-terminal  bulky  hydrophobic  residues  Phe, 
Leu,  Trp,  Tyr,  and  lie.  Previous  work  has  shown  that 
dipeptides  bearing  either  type  1  or  type  2  N-terminal 
residues  act  as  weak  but  specific  inhibitors  of  the  N-end 
rule  pathway.  We  took  advantage  of  the  two-site  archi¬ 
tecture  of  Ubrlp  to  explore  the  feasibility  of  bivalent 
N-end  rule  inhibitors,  whose  expected  higher  efficacy 
would  result  from  higher  affinity  of  the  cooperative  (bi¬ 
valent)  binding  to  Ubrlp.  The  inhibitor  comprised 
mixed  tetramers  of  j3-galaetosidase  that  bore  both  N- 
terminal  Arg  (type  1  residue)  and  N-terminal  Leu  (type  2 
residue)  but  that  were  resistant  to  proteolysis  in  vivo . 
Expression  of  these  constructs  in  S.  cerevisiae  inhibited 
the  N-end  rule  pathway  much  more  strongly  than  the 
expression  of  otherwise  identical  /3-galactosidase  tet¬ 
ramers  whose  N-terminal  residues  were  exclusively  Arg 
or  exclusively  Leu.  In  addition  to  demonstrating  spatial 
proximity  between  the  type  1  and  type  2  substrate-bind¬ 
ing  sites  of  Ubrlp,  these  results  provide  a  route  to  high 
affinity  inhibitors  of  the  N-end  rule  pathway. 


Among  the  targets  of  the  N-end  rule  pathway  are  intracel¬ 
lular  proteins  bearing  destabilizing  N-terminal  residues  (1,  2). 
This  proteolytic  pathway  is  one  of  several  pathways  of  the 
ubiquitin  (Ub)1  system,  whose  diverse  functions  include  the 
regulation  of  cell  growth,  division,  differentiation,  and  re¬ 
sponses  to  stress  (3-6).  Ub  is  a  76-residue  eukaryotic  protein 
that  exists  in  cells  either  free  or  conjugated  to  other  proteins. 
Many  of  the  Ub-dependent  regulatory  circuits  involve  proces- 
sive  degradation  of  ubiquitylated  proteins  by  the  26  S  protea- 
some,  an  ATP-dependent  multisubunit  protease  (7,  8). 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast 
Saccharomyces  cerevisiae ,  Asn  and  Gin  are  tertiary  destabiliz¬ 
ing  N-terminal  residues  in  that  they  function  through  their 
conversion,  by  the  ArTAJ -encoded  N-terminal  amidase,  into  the 
secondary  destabilizing  N-terminal  residues  Asp  and  Glu.  The 
destabilizing  activity  of  N-terminal  Asp  and  Glu  requires  their 
conjugation  by  the  A  77?7 -encoded  Arg-tRNA-protein  transfer¬ 
ase  (R-transferase)  to  Arg,  one  of  the  primary  destabilizing 
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residues  (reviewed  in  Refs.  1  and  9).  In  mammals,  two  distinct 
N-terminal  amidases  specific,  respectively,  for  N-terminal  Asn 
or  Gin  mediate  the  conversion  of  these  tertiary  destabilizing 
residues  into  the  secondary  destabilizing  residues  Asp  or  Glu 
(10,  11).  The  set  of  secondary  destabilizing  residues  in  verte¬ 
brates  contains  not  only  Asp  and  Glu  but  also  Cys,  which  is  a 
stabilizing  residue  in  yeast  (9,  12,  13). 

The  primary  destabilizing  N-terminal  residues  are  bound 
directly  by  N-recognin,  the  E3  (recognition)  component  of  the 
N-end  rule  pathway.  In  S.  cerevisiae,  N-recognin  is  the  UBR1- 
encoded  225-kDa  protein  that  binds  to  potential  N-end  rule 
substrates  through  their  primary  destabilizing  N-terminal  res¬ 
idues:  Phe,  Leu,  Trp,  Tyr,  lie,  Arg,  Lys,  and  His  (1,  14).  The 
Ubrl  genes  encoding  mouse  and  human  N-recognin  (also  called 
E3a)  have  been  cloned  as  well  (15).  N-recognin  has  at  least  two 
substrate-binding  sites.  The  type  1  site  is  specific  for  the  basic 
N-terminal  residues  Arg,  Lys,  and  His.  The  type  2  site  is 
specific  for  the  bulky  hydrophobic  N-terminal  residues  Phe, 
Leu,  Trp,  Tyr,  and  lie  (1,  12, 16,  17).  N-recognin  can  also  target 
short-lived  proteins  such  as  Cup9p  (18)  and  Gpalp  (19,  20), 
which  lack  destabilizing  N-terminal  residues.  The  Ubrlp-rec- 
ognized  degradation  signals  of  these  proteins  remain  to  be 
characterized  in  detail. 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  di-  and  tripeptide  import  in  S.  cerevisiae  through  the 
degradation  of  Cup9p,  a  transcriptional  repressor  of  the  pep¬ 
tide  transporter  gene  PTR2  (18,  21);  a  mechanistically  unde¬ 
fined  role  in  the  Slnlp-dependent  phosphorylation  cascade  that 
mediates  osmoregulation  in  S.  cerevisiae  (22);  the  degradation 
of  Gpalp,  a  Ga  protein  of  S.  cerevisiae  (19,  20);  and  the  condi¬ 
tional  degradation  of  alphaviral  RNA  polymerase  in  virus- 
infected  metazoan  cells  (23).  Physiological  N-end  rule  sub¬ 
strates  were  also  identified  among  the  proteins  secreted  into 
the  cytosol  of  the  host  cell  by  intracellular  parasites  such  as  the 
bacterium  Listeria  monocytogenes  (24).  Short  half-lives  of  these 
proteins  are  required  for  the  efficient  presentation  of  their 
peptides  to  the  immune  system  (24).  A  partial  inhibition  of  the 
N-end  rule  pathway  was  reported  to  interfere  with  mammalian 
cell  differentiation  (25)  and  to  delay  limb  regeneration  in  am¬ 
phibians  (26).  Recent  evidence  suggests  that  the  N-end  rule 
pathway  mediates  a  large  fraction  of  the  muscle  protein  turn¬ 
over  (27)  and  plays  a  role  in  catabolic  states  that  result  in 
muscle  atrophy  (28). 

Targeted  mutagenesis  has  been  used  to  inactivate  the  N-end 
rule  pathway  in  Escherichia  coli  and  S.  cerevisiae  (14,  29). 
Analogous  mutants  have  recently  been  constructed  in  the 
mouse  as  well.2  These  approaches  notwithstanding,  an  effica¬ 
cious  inhibitor  of  the  N-end  rule  pathway  would  be  useful  as 
well,  especially  with  organisms  less  tractable  genetically.  The 
emerging  understanding  of  the  N-end  rule  pathway  in  mam¬ 
mals  suggests  that  selective  inhibition  or  activation  of  this 
proteolytic  system  may  also  have  medical  applications.  Previ- 


2  Y.  T.  Kwon  and  A.  Varshavsky,  unpublished  data. 
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ous  work  has  shown  that  millimolar  concentrations  of  amino 
acid  derivatives  such  as  dipeptides  bearing  destabilizing  N- 
terminal  residues  can  selectively  inhibit  the  N-end  rule  path¬ 
way  in  extracts  from  rabbit  reticulocytes  (12,  17)  and  Xenopus 
eggs  (13),  and  in  intact  S.  cerevisiae  cells  as  well  (16).  However, 
the  same  dipeptides  were  observed  to  have  at  most  marginal 
effects  on  the  N-end  rule  pathway  in  intact  mammalian  cells.3 
One  limitation  of  dipeptide  inhibitors  is  their  apparently  low 
affinity  for  the  type  1  and  the  type  2  site  of  N-recognin  (30). 

In  the  present  work,  we  explored  the  possibility  that  a  biva¬ 
lent  ligand  can  bind  simultaneously  to  the  type  1  and  type  2 
sites  of  N-recognin  (see  Fig.  1 A).  Similarly  to  the  previously 
characterized  bivalent  interactions  that  involve  either  macro¬ 
molecules  or  small  molecules  (31,  32),  the  cooperativity  of  bind¬ 
ing  at  two  independent,  mutually  nonexclusive  sites  would  be 
expected  to  increase  the  affinity  between  N-recognin  and  a 
bivalent  inhibitor  by  orders  of  magnitude,  in  comparison  with 
the  affinity  of  a  monovalent  binding  by  the  same  compound.  We 
show  that  a  bivalent  inhibitor  of  the  N-end  rule  pathway  is 
feasible  and  consider  the  implications  of  this  advance. 

EXPERIMENTAL  PROCEDURES 

Strains  and  General  Techniques— The  S.  cerevisiae  strains  used  were 
JD52  (. MATa  ura3-52  his3-A200  leu2-3,112  trpl-A63  lys2-801)  and 
JD55  t MATa  ura3-52  his3-A200  Ieu2-3,U2  trpl-A63  lys2-801  ubrl 
A::  HI  S3)  (19,  33).  Cells  were  grown  on  rich  (YPD)  or  synthetic  medium 
containing  either  2%  dextrose  (SD  medium),  2%  galactose  (SG  medi¬ 
um),  or  2%  raffinose  (SR  medium)  (34).  To  induce  the  Pcupi  promoter, 
CuS04  was  added  to  a  final  concentration  of  0.1  mM.  Transformation  of 
S.  cerevisiae  was  carried  out  using  the  lithium  acetate  method  (35). 

Plasmids — The  high  copy  (2^-based)  plasmids  pRA-0gal-TRPl  and 
pRA-j3gal-HIS3,  which  expressed  Arg-eAK-/3gal  (Ub-Arg-eAK-0gal)  (see 
Fig.  2A)  from  the  galactose-inducible  PCyci/gali  hybrid  promoter  (2), 
were  produced  by  replacing  the  URA3  marker  gene  of  pFL7  with  either 
TRP1  or  HIS3 .  pLA-jSgal-TRPl  and  pLA-0gal-HIS3,  both  of  which  ex¬ 
pressed  Leu-eAK-0gal  (Ub-Leu-eAK-0gal),  were  produced  by  replacing 
the  Ub-Arg  domain  of  pR A-0gal-TRP  1  and  pRA-/3gal-HIS3  with  Ub-Leu 
domain  of  the  pLL2  plasmid.4  The  plasmid  pFL7  was  produced  from 
pUB23-R  (2)  by  converting  the  lysine  codons  15  and  17  of  the  extension 
eK  into  arginine  codons  (36,  37),  yielding  a  construct  encoding  the 
extension  eAK  in  front  of  a  0gal  moiety  lacking  the  first  23  residues  of 
wild  type  0gal  (see  Fig.  2A).  The  low  copy,  pRS315  vector-derived  (38) 
plasmid  PR-eAKhaUra3-R3R7  expressed  Arg-eAK-ha-Ura3pK3R’K7R  (Ub- 
Arg-eAK-ha-Ura3pK3R’K7R)  from  the  PCUP1  promoter.  Arg-eAK-ha- 
Ura3pK3R,K7R  (see  Fig.  2 B)  is  called  Arg-Ura3p  in  the  main  text.  In  this 
N-end  rule  substrate,  the  residues  Lys-3  and  Lys-7  of  the  S.  cerevisiae 
Ura3p  were  converted  to  arginines  (see  “Results  and  Discussion”).  In 
addition,  the  ha  epitope  tag  (39)  was  placed  between  eAK  and 
Ura3pK3R,K7R  (see  Fig.  2 B).  The  plasmid  PR-eAKhaUra3-R3R7  was 
produced  from  pFLl  (encoding  Ub-Arg-eAK-ha-Ura3p)  through  site-di¬ 
rected  mutagenesis  of  the  XJRA3  codons  for  Lys-3  and  Lys-7.  pFLl  was 
produced  from  pKM1235  (which  encoded  Ub-Arg-eK-ha-Ura3p)5  by  con¬ 
verting  the  encoding  sequence  into  the  one  encoding  eAK. 

Pulse-Chase  and  Plating  Efficiency  Assays— Pulse-chase  assays  with 
S.  cerevisiae  in  mid-exponential  growth  (A600  of  ~1)  utilized  35S-EX- 
PRESS  (NEN  Life  Science  Products)  and  were  carried  out  as  described 
previously  (10,  19),  including  the  immunoprecipitation  with  anti-/3gal 
and  anti-ha  antibodies  and  quantitation  with  a  Phosphorlmager  (Mo¬ 
lecular  Dynamics,  Sunnyvale,  CA).  To  determine  plating  efficiency,  S. 
cerevisiae  strains  JD52  ( TJBR1 )  and  JD55  (ubrl A)  expressing  Arg- 
Ura3p  (Ub-Arg-eAK-ha-Ura3pK3R,K7R;  see  Fig.  2 B)  were  co-transformed 
with  plasmids  indicated  in  the  legend  to  Fig.  3.  The  transformants  were 
cultured  in  the  raffinose-based  medium  (SR)  lacking  Leu,  His,  and  Trp 
for  20  h.  The  cultures  were  then  diluted  into  the  otherwise  identical 
galactose-containing  (SG)  medium  to  a  final  A600  of  0.1.  At  an  A600  of 
0.4,  cultures  were  either  supplemented  with  0.1  mM  CuS04  or  left 
unsupplemented.  At  the  A600  of  1.0,  the  cultures  were  diluted  with  SG 
(which  lacks  Leu,  His,  and  Trp)  either  containing  or  lacking  0.1  mM 
CuS04  and  were  plated  on  the  plates  of  the  same  medium  composition 
that  also  either  contained  or  lacked  uracil.  The  plating  efficiency  (%) 


3  F.  Levy  and  A.  Varshavsky,  unpublished  data. 

4  M.  Ghislain  and  A.  Varshavsky,  unpublished  data. 

5  K.  Madura  and  A.  Varshavsky,  unpublished  data. 
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Fig.  1.  The  concept  of  a  bivalent  inhibitor  of  the  N-end  rule 
pathway.  A,  the  type  1  and  type  2  sites  of  S.  cerevisiae  Ubrlp  (N- 
recognin),  which  are  specific,  respectively,  for  the  basic  (Arg,  Lys,  and 
His)  and  bulky  hydrophobic  (Phe,  Leu,  Trp,  Tyr,  and  lie)  N-terminal 
residues.  In  the  diagram,  the  type  1  and  type  2  sites  are  occupied  by 
their  ligands,  the  N-terminal  Arg  and  Leu,  borne  by  a  heterodimeric 
bivalent  inhibitor  (actually,  a  tetrameric  0gal-based  protein  in  the 
present  work).  A  test  substrate  bearing  Arg,  a  type  1  destabilizing 
N-terminal  residue  is  shown  as  well.  The  test  substrate,  in  contrast  to 
the  protein-based  inhibitor,  bears  at  least  one  internal  Lys  residue  (not 
indicated  in  the  diagram)  that  can  function  as  a  component  of  the 
N-degron.  The  type  1  and  type  2  sites  of  N-recognin  are  shown  located 
close  together  in  the  N-terminal  region  of  the  225-kDa  Ubrlp.  The 
recent  genetic  dissection  of  the  Ubrlp  substrate-binding  sites6  placed 
the  type  1  and  type  2  sites  close  together  in  the  ~60-kDa  N-terminal 
region  of  the  225-kDa  Ubrlp.  B,  a  diagram  illustrating  the  expected 
frequencies  of  heterodimeric  (Arg-  and  Leu-bearing)  dimers  within  a 
0gal-based  bivalent  inhibitor.  Specifically,  at  equal  levels  of  expression 
of  the  two  0gal-based  polypeptide  chains,  50%  of  0gal  tetramers  would 
be  expected  to  be  heterotetramers  in  which  at  least  one  of  the  two 
dimers  bears  different  (Arg  and  Leu)  N-terminal  residues.  In  the  0gal 
tetramer,  the  two  N  termini  of  each  dimer  are  spatially  close,  exposed, 
and  oriented  in  the  same  direction  (40).  See  also  “Results  and 
Discussion.” 

was  defined  as  the  ratio  of  the  number  of  colonies  on  SG  (-Leu,  -His, 
-Trp,  -Ura)  plates  to  the  number  of  colonies  on  SG  (-Leu,  -His, 
-Trp)  plates,  at  the  same  concentration  of  CuS04.  For  each  measure¬ 
ment,  colonies  on  15  plates  were  counted  to  yield  the  average  number  of 
colonies  per  plate. 

RESULTS  AND  DISCUSSION 

We  constructed  a  bivalent  N-end  rule  inhibitor  (Fig.  1A)  from 
the  previously  studied  N-end  rule  substrates  derived  from  E. 
coli  0gal  (2).  In  eukaryotes,  linear  Ub-protein  fusions  are  rap¬ 
idly  cleaved  by  deubiquitylating  enzymes  at  the  Ub-protein 
junction,  making  possible  the  production  of  otherwise  identical 
proteins  bearing  different  N-terminal  residues,  a  technical  ad¬ 
vance  that  led  to  the  finding  of  the  N-end  rule  (2).  A  /3gal-based 
N-end  rule  substrate  contains  a  destabilizing  N-terminal  resi¬ 
due  (produced  in  vivo  using  the  Ub  fusion  technique  (1));  a 
— 45-residue,  E.  coli  Lac  repressor-derived  N-terminal  exten¬ 
sion  called  eK  (extension  e  bearing  lysines  K)\  and  the  /3gal 
moiety  lacking  its  first  21  residues.  The  resulting  X-eK-/3gal  is 
a  short-lived  protein  in  both  yeast  and  mammalian  cells, 
whereas  an  otherwise  identical  protein  bearing  a  stabilizing 
N-terminal  residue  such  as  Met  or  Val  is  metabolically  stable 
(1,  2).  An  N-degron  comprises  a  destabilizing  N-terminal  resi¬ 
due  and  a  Lys  residue  (or  residues),  the  latter  being  the  site  of 
formation  of  a  multi-Ub  chain  (1,  36).  (Ubrlp  can  also  recognize 
a  set  of  other,  internal  degrons,  which  remain  to  be  character¬ 
ized  (18).)  If  Lys-15  and  Lys-17  of  the  eK  extension  are  replaced 
by  the  Arg  residues  (which  cannot  be  ubiquitylated),  the  result¬ 
ing  X-eAK-/3gal  (Fig.  2A)  is  long-lived  in  vivo  even  if  its  N- 
terminal  residue  is  destabilizing  in  the  N-end  rule  (1,  37). 

In  the  present  work,  we  used  the  metabolically  stable  Arg- 
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Fig.  2.  Designs  of  bivalent  inhibitor  and  test  substrate.  A ,  the 

j3gal-based  fusions  (the  residue  X  was  either  Arg  or  Leu)  used  to 
construct  the  Arg/Leu -bearing  bivalent  inhibitor.  The  Ub  moiety  of  the 
fusions  was  cotranslationally  removed  in  vivo  by  deubiquitinating  en¬ 
zymes  (1).  The  ~45-residue,  E.  coli  Lac  repressor-derived  sequence 
termed  eAK  (extension  ( e )  lacking  lysines  (AID),  is  described  in  the  main 
text.  The  £gal  part  of  the  fusion  lacked  the  first  21  residues  of  wild  type 
j3gal  (2).  B,  the  Ura3p-based  N-end  rule  substrate,  Arg-eAK-ha- 
Ura3pK3RK7R,  derived  from  Ub-Arg-eAK-ha-Ura3pK3R  K7R  and  denoted 
Arg-Ura3p,  is  described  in  the  main  text. 


eAK-j3gal  (produced  from  Ub-Arg-e  AK-j3gal)  and  Leu-eAK-/3gal 
(produced  from  Ub-Leu-eAK-/3gal).  These  proteins  retain  the 
ability  to  bind,  respectively,  to  the  type  1  and  type  2  sites  of 
N-recognin  but  cannot  be  ubiquitylated  (37),  apparently  be¬ 
cause  the  most  N-terminal  Lys  residue  in  X-eAK-j3gal,  at  posi¬ 
tion  239,  is  too  far  from  the  N  terminus  of  the  protein.  In  the 
/3gal  tetramer,  the  two  N  termini  of  each  dimer  are  spatially 
close,  exposed,  and  oriented  in  the  same  direction  (40).  At  equal 
levels  of  expression  of  the  two  /3gal-based  polypeptide  chains 
such  as  Arg-eAK-/3gal  and  Leu-eAK-/3gal,  50%  of  0gal  tetramers 
would  be  expected  to  be  heterotetramers  in  which  at  least  one 
of  the  two  dimers  bears  different  (Arg  and  Leu)  N-terminal 
residues  (Fig.  IB).  If  the  type  1  and  type  2  substrate-binding 
sites  of  the  225 -kDa  Ubrlp  are  appropriately  located  and  ori¬ 
ented,  they  might  be  able  to  bind  the  Arg-  and  Leu-bearing 
subunits  of  the  mixed  /3gal  tetramer,  especially  in  view  of  the 
presumed  flexibility  of  the  eAK  extension  (1)  (Fig.  1A). 

The  reporter  N-end  rule  substrate  in  this  study  was  Arg-eAK- 
ha-Ura3pK3R,K7R,  denoted  below  as  Arg-Ura3p  (Fig.  2 B).  This 
ha-tagged,  type  1  N-end  rule  substrate  was  produced  from 
Ub-Arg-e  AK-ha-Ura3pK3R>K7R  through  the  cotranslational  in 
vivo  cleavage  by  deubiquitinating  enzymes  (1,  6,  41).  The  ly¬ 
sine-lacking  eAK  extension  of  Arg-eAK-ha-Ura3pK3R  K7R,  and 
the  replacement  of  the  first  two  lysines  of  the  Ura3p  moiety 
with  arginines  were  used  to  decrease  the  rate  of  degradation  of 
Arg-Ura3p  by  the  N-end  rule  pathway  and  also  to  reduce  the 
slow  but  detectable  degradation  of  Arg-Ura3p  by  yet  another 
pathway,  through  a  degron  distinct  from  the  N-degron.3  Sev¬ 
eral  Lys  residues  of  Ura3p  other  than  Lys-3  and  Lys-7  are  also 
close  to  its  N  terminus,  thus  accounting  for  the  absence,  in  this 
case,  of  the  all-or-none  effect  on  the  reporter  degradation  that 
is  observed  when  eK  is  replaced  with  eAK  in  an  X-eK-/3gal 
substrate  (37).  The  Lys-3  — »  Arg  and  Lys-7  — »  Arg  modifications 
decreased  the  enzymatic  activity  of  the  Ura3p  moiety.2  The 
reduced  enzymatic  activity  of  Ura3pK3R>K7R  facilitated  selec¬ 
tion  assays  (Figs.  3  and  4). 

The  first  bivalent  inhibitor  assay  employed  ura3  S.  cerevisiae 
expressing  Arg-Ura3p  (Fig.  2 B)  from  the  uninduced  Pcupi  Pr0‘ 
moter.  The  Ubrlp-mediated  degradation  of  Arg-Ura3p  (ty,  of 
~8  min)  and  its  correspondingly  low  steady-state  concentration 
rendered  wild  type  ( UBR1 )  cells  phenotypically  Ura-,  whereas 
ubrlA  strains  expressing  Arg-Ura3p  were  phenotypically  Ura+ 
(Figs.  3  and  4  and  data  not  shown).  Cells  expressing  Arg-Ura3p 
were  co transformed  with  two  control  plasmids  (vectors; 


SG(-Leu,-His,-Trp,-Ura)  SG(-Leu,-His,-Trp,-Ura) 
(+  Cu++)  (-aT) 


Fig.  3.  A  bivalent,  but  not  monovalent,  inhibitor  of  Ubrlp  con¬ 
fers  Ura+  phenotype  on  cells  expressing  the  short-lived  N-end 
rule  substrate  Arg-Ura3p.  S.  cerevisiae  JD52  ( UBR1 )  expressing 
Arg-Ura3p  (Ub-Arg-Ura3p),  a  short-lived  reporter  (Fig.  2B),  were  co- 
transformed,  alternatively,  with  pRS423  (7//<S5 -based  control  vector) 
and  pRS424  (PGA!jl,  TRP1- based  control  vector)  (denoted  as  423+424 ); 
with  pRS424  and  pRA-j3gal-HIS3,  expressing  Arg-eAK-j3gal  (Ub-Arg- 
eAK-/3gal)  (denoted  as  R+424;  monovalent  inhibitor);  with  pRS423  and 
pLA-/3gal-TRPl,  expressing  Leu-eAK-j3gal  (Ub-Leu-eAK-/3gal)  (denoted 
as  423  +L);  with  pRA-j3gal-HIS3  and  pRA-/3gal-TRPl,  both  expressing 
Arg-eAK-j3gal  (denoted  as  R+R;  monovalent  inhibitor);  with  pLA-/3gal- 
HIS3  and  pLA-/3gal-TRPl,  both  expressing  Leu-eAK-/3gal  (denoted  as 
L+L);  or  with  pRA-j3gal-HIS3  and  pLA-j3gal-TRPl,  expressing  Arg-eAK- 
pgal  and  Leu-eAK-/3gal  (denoted  as  R+L;  the  bivalent  inhibitor).  Cells 
were  streaked  on  SG  medium  containing  0.1  mM  CuS04  and  lacking 
Leu,  His,  Trp,  and  Ura  ( upper  left  panel),  on  the  otherwise  identical 
medium  lacking  the  added  CuS04  ( upper  right  panel),  or  on  the  Ura- 
containing  SG  medium  lacking  Leu,  His,  and  Trp  (controls;  lower  left 
panel).  Plates  were  incubated  at  30  °C  for  3  days. 


423+424  in  Fig.  3).  Alternatively,  these  cells  were  cotrans¬ 
formed  with  two  plasmids  (bearing  different  selectable  mark¬ 
ers)  that  expressed  either  Arg-eAK-j3gal  alone  ( R+R  in  Fig.  3), 
Leu-eAK-j3gal  alone  ( L+L  in  Fig.  3),  or  both  of  them  together 
(R+L  in  Fig.  3;  the  bivalent  inhibitor  mode)  from  a  galactose- 
inducible  promoter.  Pairs  of  alternatively  marked  plasmids 
were  used  to  make  certain  that  the  conditions  of  expression  and 
the  total  amounts  of  jSgal-based  proteins  produced  remained 
the  same  in  all  of  these  settings.  The  transformants  were 
streaked  on  SG  medium  lacking  uracil. 

Remarkably,  only  those  Arg-Ura3p-expressing  cells  that  ex¬ 
pressed  both  Arg-eAK-/3gal  and  Leu-eAK-j3gal  became  Ura+  un¬ 
der  these  conditions  (Fig.  3).  The  cells  that  expressed  either 
Arg-eAK-j3gal  alone  or  Leu-eAK-j3gal  alone  remained  Ura-,  as 
did  the  cells  that  received  control  plasmids  (Fig.  3).  (The  same 
cells  grew  equally  well  in  the  control  SG  medium  containing 
uracil  (Fig.  3,  bottom  left  panel).)  Note  that  the  monovalent 
inhibitors  were  ineffective  despite  the  fact  that  the  concentra¬ 
tion  of  either  the  Arg-based  N  terminus  alone  or  the  Leu-based 
N  terminus  alone  was  twice  the  concentration  of  the  same  N 
termini  in  the  case  of  the  bivalent  inhibitor. 

To  quantify  the  effect  of  coexpressing  Arg-eAK-/3gal  and  Leu- 
eAK-/3gal  on  the  rescue  of  the  Ura+  phenotype,  a  plating  effi¬ 
ciency  assay  was  carried  out  with  the  same  transformants. 
Equal  amounts  of  cells  were  plated  on  SG(+Ura)  and 
SG(— Ura)  plates,  and  the  numbers  of  colonies  were  deter¬ 
mined.  When  the  Arg-Ura3p  reporter  was  expressed  at  a  suf¬ 
ficiently  low  rate  (uninduced  PCupi  promoter),  cells  became 
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Fig.  4.  Plating  efficiencies  of  S. .  cerevisiae  expressing  Arg- 
Ura3p  in  the  presence  of  bivalent  and  monovalent  inhibitors  of 
the  N-end  rule  pathway.  A,  S.  cerevisiae  JD52  ( UBR1 )  and  JD55 
(i ubrlA )  expressing  Arg-Ura3p  were  cotransformed  with  the  sets  of 
plasmids  described  and  denoted  in  the  legend  to  Fig.  3.  The  transfor¬ 
mants  were  cultured  as  described  under  “Experimental  Procedures” 
and  plated  on  either  SG(-Leu,  -His,  -Trp,  -Ura)  plates  or  control 
plates  SG(-Leu,  -His,  -Trp)  containing  0.1  mM  CuS04.  The  plating 
efficiencies  shown  are  the  values  produced  by  normalization  against  the 
absolute  plating  efficiency  (92%)  of  the  positive  control:  the  ubrlA 
strain  JD55  expressing  Arg-Ura3p  and  bearing  the  vector  pRS424.  B, 
the  same  experiment  was  done  using  plates  lacking  the  added  CuS04. 
The  plating  efficiencies  shown  are  the  values  produced  by  normaliza¬ 
tion  against  the  positive  control  used  in  A.  Under  these  growth  condi¬ 
tions  (no  added  CuS04),  the  absolute  plating  efficiency  of  the  positive 
control  was  26%. 

Ura+  (through  metabolic  stabilization  of  Arg-Ura3p)  only  in 
the  presence  of  both  Arg-eAK-/3gal  and  Leu-eAK-/3gal  (Fig.  4B).  A 
weak  stabilizing  effect  of  Arg-eAK-j3gal  alone  could  be  detected 
only  at  a  -20-fold  higher  level  of  Arg-Ura3p  expression  (in¬ 
duced  PCUP1  promoter)  (Fig.  4A).  No  stabilization  of  Arg-Ura3p 
was  observed  in  the  presence  of  Leu-eAK-/3gal  under  any  con¬ 
ditions  (Fig.  4),  confirming  the  specificity  of  inhibition  in  regard 
to  the  type  (basic  or  bulky  hydrophobic)  of  the  primary  desta¬ 
bilizing  N-terminal  residue  of  the  reporter.  Higher  sensitivity 
of  this  assay  at  the  higher  level  of  Arg-Ura3p  expression  results 
from  a  higher  steady-state  level  of  the  short-lived  Arg-Ura3p, 
so  that  even  its  marginal  stabilization  suffices  to  render  a  small 
fraction  of  cells  Ura+  (Fig.  4A;  compare  with  Fig.  4B). 

To  analyze  directly  the  in  vivo  degradation  of  Arg-Ura3p  in 
the  presence  of  different  combinations  of  X-eAK-j3gal  proteins, 
the  transformants  of  Figs.  3  and  4  were  subjected  to  pulse- 
chase  analysis,  with  immunoprecipitation  of  both  Arg-Ura3p 
and  the  (long-lived)  X-eAK-j3gals  (Fig.  5).  Quantitation  of  the 
resulting  electrophoretic  patterns  (Fig.  5 C)  confirmed  and  ex¬ 
tended  the  conclusions  reached  through  phenotypic  analyses 
(Figs.  3  and  4).  Specifically,  the  normally  short-lived  Arg- 
Ura3p  (Fig.  5A,  lanes  1-3 )  was  strongly  (but  still  incompletely) 
stabilized  in  the  presence  of  both  Arg-eAK-/3gal  and  Leu-eAK- 
/3gal  (Fig.  5A,  lanes  4-6;  compare  with  lanes  1-3  and  7-9).  This 
stabilization  was  manifested  especially  clearly  as  an  increase 
in  the  relative  amount  of  Arg-Ura3p  at  the  beginning  of  chase 
(time  0),  indicating  reduced  degradation  of  Arg-Ura3p  during 
the  pulse  (Fig.  5C).  This  latter  degradation  pattern,  termed 
“zero  point  effect,”  is  caused  by  the  previously  demonstrated 
preferential  targeting  of  newly  formed  (as  distinguished  from 
conformationally  mature)  protein  substrates  by  the  N-end  rule 
pathway  (16,  42).  The  increased  steady-state  level  of  Arg- 
Ura3p  in  the  presence  of  both  Arg-eAK-/3gal  and  Leu-eAK-/3gal 
accounted  for  the  results  of  phenotypic  analyses  (Figs.  3  and  4). 
The  much  smaller  but  detectable  stabilization  of  Arg-Ura3p  by 
Arg-eAK-j3gal  alone  (Fig.  50  was  consistent  not  only  with  the 
inability  of  Arg-eAK-j3gal  to  confer  the  Ura+  phenotype  on  cells 
expressing  Arg-Ura3p  from  uninduced  PCupi  promoter  but  also 
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Fig.  5.  Metabolic  stabilization  of  Arg-Ura3p  in  the  presence  of 
bivalent  N-end  rule  inhibitor.  A,  S.  cerevisiae  JD52  ( UBR1 )  express¬ 
ing  Arg-Ura3p,  a  short-lived  Ura3p-based  reporter  (Fig.  2B),  from  the 
induced  PCUP1  promoter,  were  cotransformed,  alternatively,  with  either 
pRS423  (H/S3-based  control  vector)  and  pRS424  (PGAL1,  TRP  1 -based 
control  vector)  (denoted  as  423+424 ),  or  with  pRA-/3gal-TRPl  and  pLA- 
/3gal-HIS3,  expressing  Arg-eAK-/3gal  and  Leu-eAK-j3gal  (denoted  as  R+L; 
the  bivalent  inhibitor).  Control  JD55  (ubrlA)  cells  expressing  Arg- 
Ura3p  were  transformed  with  pRS424.  Cells  grown  in  either  dextrose- 
containing  SD  medium  (no  expression  of  j3gal)  or  galactose-containing 
SG  medium  were  labeled  with  [35S] methionine/cysteine  for  5  min  at 
30  °C,  followed  by  a  chase  for  0,  10,  and  30  min,  extraction,  immuno¬ 
precipitation,  and  SDS-10%  polyacrylamide  gel  electrophoresis.  B, 
same  as  in  A,  but  cells  were  also  cotransformed  with  the  plasmids 
pRA-j3gal-TRPl  and  pRA-j3gal-HIS3,  both  expressing  Arg-eAK-/3gal  (de¬ 
noted  as  R+R).  The  assays  were  carried  out  in  SG  medium.  C,  in  vivo 
decay  curves  of  Arg-Ura3p  (Fig.  2 B)  in  wild  type  (JD52)  and  ubrlA 
(JD55)  cells.  The  patterns  in  A  and  B  were  quantified  as  described 
under  “Experimental  Procedures.”  The  initial  amounts  of  Arg-Ura3p 
were  normalized  against  the  amount  in  ubrlA  cells  (100%).  Note  that 
the  inhibitors  also  altered  the  zero  point  effect  (degradation  of  a  re¬ 
porter  during  the  pulse  (16,  42)).  •,  degradation  of  Arg-Ura3p  in  UBR1 
(JD52)  cells  in  the  presence  of  both  Arg-eAK-j3gal  and  Leu-eAK-/3gal  (cells 
were  grown  in  SG  medium);  O,  the  same  transformants  were  grown  in 
SD  medium  where  j3gal  fusions  were  not  expressed;  ▼,  JD52  cells 
expressing  Arg-Ura3p  were  transformed  with  the  two  alternatively 
marked  plasmids  expressing  Arg-eAK-/3gal  and  grown  in  SG  medium;  □, 
JD52  cells  expressing  Arg-Ura3p  were  transformed  with  the  two  alter¬ 
natively  marked  control  vectors  and  grown  in  SG  medium;  ■,  ubrlA 
(JD55)  cells  expressing  Arg-Ura3p  were  transformed  with  control  vec¬ 
tors  and  grown  in  SG  medium. 

with  the  partial  rescue  of  the  Ura+  phenotype  by  Arg-eAK-/3gal 
in  cells  expressing  Arg-Ura3p  from  the  induced  Pcupi  (Figs*  3 
and  4  and  data  not  shown). 
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*  The  Arg/Leu-eAK-/3gal-based  bivalent  inhibitor  of  the  present 
work,  although  surprisingly  potent  (Fig.  4£),  is  obviously  far 
from  optimal  even  for  a  protein-based  inhibitor;  because  /3gal  is 
a  homotetramer,  only  -50%  of  the  coexpressed  Arg-eAK-/3gal 
and  Leu-eAK-/3gal  chains  would  exist  as  heterodimers  within 
tetramers  (Fig.  IB).  (This  estimate  assumes  a  random  assort¬ 
ment  of  Arg-  and  Leu-bearing  /3gal  chains  in  the  formation  of 
/3gal  tetramers.  The  actual  in  vivo  assortment  is  expected  to  be 
biased,  to  an  unknown  extent,  in  favor  of  homodimeric  associ¬ 
ations,  because  individual  polysomes  would  produce  J3gal 
chains  bearing  either  Arg  or  Leu  but  not  both.)  In  addition, 
although  the  eAK  extension  (Fig.  2 A)  is  capable  of  supporting 
the  desired  effects,  it  is  also  unlikely  to  be  optimal.  In  sum¬ 
mary,  the  efficacy  of  this  first  and  necessarily  suboptimal  bi¬ 
valent  inhibitor  bodes  well  for  the  future  of  this  design. 

A  bivalent  inhibitor  is  strikingly  more  efficacious  than  an 
otherwise  identical  monovalent  inhibitor  (Figs.  3-5).  In  addi¬ 
tion,  our  findings  are  the  first  evidence  that  the  type  1  and  type 
2  sites  of  N-recognin  are  spatially  proximal  in  the  225-kDa  S. 
cerevisiae  Ubrlp.  While  this  work  was  under  way,  genetic 
dissection  of  S.  cerevisiae  Ubrlp  identified  amino  acid  residues 
that  are  required  for  the  integrity  of  the  type  1  site  but  not  the 
type  2  site,  and  vice  versa.6  These  results  provided  independent 
evidence  for  both  the  separateness  and  spatial  proximity  of  the 
two  substrate-binding  sites  of  the  225-kDa  N-recognin,  in 
agreement  with  the  present  data.  Our  results  (Figs.  3-5) 
strongly  suggest  that  small  bivalent  inhibitors  of  the  N-end 
rule  pathway  are  feasible,  and  moreover,  are  expected  to  be 
much  more  potent  than  their  monovalent  counterparts.  Work 
to  produce  such  inhibitors  is  under  way. 
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The  split-ubiquitin  technique  was  used  to  detect  transient  protein  interactions  in  living 
cells.  Nub,  the  N-terminal  half  of  ubiquitin  (Ub),  was  fused  to  Sec62p,  a  component  of  the 
protein  translocation  machinery  in  the  endoplasmic  reticulum  of  Saccharomyces  cerevisiae. 
Cub,  the  C-terminal  half  of  Ub,  was  fused  to  the  C  terminus  of  a  signal  sequence.  The 
reconstitution  of  a  quasi-native  Ub  structure  from  the  two  halves  of  Ub,  and  the  resulting 
cleavage  by  Ub-specific  proteases  at  the  C  terminus  of  Cub,  serve  as  a  gauge  of  proximity 
between  the  two  test  proteins  linked  to  Nub  and  Cub.  Using  this  assay,  we  show  that 
Sec62p  is  spatially  close  to  the  signal  sequence  of  the  prepro-a-factor  in  vivo.  This 
proximity  is  confined  to  the  nascent  polypeptide  chain  immediately  following  the  signal 
sequence.  In  addition,  the  extent  of  proximity  depends  on  the  nature  of  the  signal 
sequence.  Cub  fusions  that  bore  the  signal  sequence  of  invertase  resulted  in  a  much  lower 
Ub  reconstitution  with  Nub-Sec62p  than  otherwise  identical  test  proteins  bearing  the 
signal  sequence  of  prepro-a-factor.  An  inactive  derivative  of  Sec62p  failed  to  interact 
with  signal  sequences  in  this  assay.  These  in  vivo  findings  are  consistent  with  Sec62p 
being  part  of  a  signal  sequence-binding  complex. 


INTRODUCTION 

A  critical  step  during  the  translocation  of  a  protein 
across  the  membrane  of  the  endoplasmic  reticulum 
(ER)  is  the  interaction  between  the  signal  sequence  of 
a  nascent  polypeptide  and  its  receptors  (Walter  et  al, 
1981;  Gilmore  and  Blobel,  1985;  Walter  and  Johnson, 
1994).  A  stretch  of  8  to  12  hydrophobic  residues,  often 
at  the  N  terminus  of  a  protein,  comprises  a  signal 
sequence  that  is  sufficient  to  initiate  the  protein's 
translocation  into  the  endoplasmic  reticulum  (ER) 
(Rapoport  et  al,  1996).  To  be  compatible  with  a  high 
flux  of  polypeptides  through  a  limited  number  of 
translocation  channels  in  the  ER  membrane,  the  inter¬ 
action  between  the  signal  sequence  and  its  receptors 
has  to  be  short  lived.  Its  transient  nature  makes  such  a 
receptor-ligand  interaction  difficult  to  study,  espe¬ 
cially  in  living  cells.  The  approaches  used  for  the 
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analysis  of  protein  translocation  in  cell-free  systems 
circumvent  the  transience  of  the  signal  sequence-re¬ 
ceptor  interaction  by  pausing  or  stopping  the  synthe¬ 
sis  of  a  nascent  polypeptide  chain  at  different  stages  of 
its  movement  to  and  across  the  ER  membrane  (Krieg  et 
al. ,  1986;  Kurzchalia  et  al. ,  1986;  Connolly  et  al.,  1989). 
Given  these  constraints,  it  is  essential  to  verify  in  vivo 
the  models  derived  from  in  vitro  studies.  The  ability  to 
analyze  early  translocation  events  in  vivo  should  also 
be  important  for  defining  the  immediate  environment 
of  the  nascent  chain  on  its  path  from  the  ribosome  to 
the  ER  membrane. 

Most  of  the  current  methods  for  detecting  protein  in¬ 
teractions  in  vivo  either  do  not  operate  at  the  ER  mem¬ 
brane  or  are  unable  to  detect  a  transient  proximity  be¬ 
tween  proteins  (Fields  and  Song,  1989;  Aronheim  et  al, 
1997;  Rossi  et  al,  1997;  Miyawaki  et  al,  1997).  In  the 
present  work,  we  show  that  the  previously  developed 
split-ubiquitin  (split-Ub)  technique,  also  called  USPS 
(Ub/split/protein/sensor)  (Johnsson  and  Varshavsky, 
1994a,  1997),  is  capable  of  detecting  a  transient  in  vivo 
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Figure  1.  The  split-Ub  technique  and  its  application  to  the  analysis 
of  protein  translocation.  (A)  Nub  and  Cub  are  linked  to  the  interact¬ 
ing  proteins  X  and  Y.  The  X-Y  complex  brings  Nub  and  Cub  into 
close  proximity.  Nub  and  Cub  reconstitute  a  quasi-native  Ub  moiety, 
which  is  cleaved  by  the  UBPs,  yielding  the  free  reporter  R  (Johnsson 
and  Varshavsky,  1994a).  (B)  Using  split-Ub  to  monitor  the  proximity 
between  a  secretory  protein  and  a  component  of  the  translocation 
machinery.  A  signal  sequence-bearing  Cub  fusion  (SS-Y-Cub-R)  and 
a  Nub  fusion  (Nub-X)  are  coexpressed  in  a  cell.  Pathway  1:  when  Nub 
is  linked  to  a  protein  not  involved  in  the  targeting  for  translocation, 
the  uncleaved  (except  for  the  signal  sequence  SS)  Y-Cub-R  enters  the 
ER.  Pathway  2:  when  Nub  is  linked  to  a  protein  involved  in  the 
targeting  for  translocation  the  signal  sequence  of  the  SS-Y-Cub-R 
brings  Nub  and  Cub  into  close  contact.  As  a  result,  some  of  the 
SS-Y-Cub-R  and  Nub-X  molecules  interact  to  form  the  quasi-native 
Ub,  yielding  the  free  reporter  R  in  the  cytosol. 

interaction  between  polypeptides.  The  split-Ub  method 
is  based  on  the  ability  of  Nub  and  Cub,  the  N-  and 
C-terminal  halves  of  Ub,  to  assemble  into  a  quasi-native 
Ub.  Ub-specific  proteases  (UBPs),  which  are  present  in 
all  eukaryotic  cells,  recognize  the  reconstituted  Ub,  but 
not  its  halves,  and  cleave  the  Ub  moiety  off  a  reporter 
protein  that  had  been  linked  to  the  C  terminus  of  Cub. 
The  liberation  of  the  reporter  serves  as  a  readout  indi¬ 
cating  the  reconstitution  of  Ub.  The  assay  is  designed  in 
a  way  that  prevents  efficient  association  of  Nub  and  Cub 
by  themselves,  but  allows  it  if  the  two  Ub  halves  are 
separately  linked  to  proteins  that  interact  in  vivo  (Figure 
1A).  The  split-Ub  assay  has  been  shown  to  detect  the  in 
vivo  dimerization  of  a  leucine  zipper-containing  domain 
of  the  Gcn4p  transcriptional  activator,  and  the  in  vivo 
interaction  between  two  subunits  of  the  oligosaccharyl- 
transferase  complex  (Johnsson  and  Varshavsky,  1994a; 
Stagljar  et  al ,  1998). 

In  the  present  work,  we  focus  on  the  interaction 
between  Sec62p  of  the  yeast  Saccharomyces  cerevisiae 


and  proteins  bearing  two  different  signal  sequences. 
Extensive  evidence  indicates  that  Sec62p  is  a  compo¬ 
nent  of  the  ER  translocation  machinery  (Deshaies  and 
Schekman,  1989;  Rothblatt  et  ah,  1989;  Musch  et  al, 
1992).  Sec62p  is  a  part  of  the  tetrameric  Sec62/63 
complex  that  also  contains  Sec71p  and  Sec72p  (De¬ 
shaies  et  al,  1991;  Feldheim  and  Schekman,  1994). 
Sec62/63p  can  be  isolated  as  a  tetramer,  or  as  a  part 
of  a  larger  assembly,  the  heptameric  Sec  complex 
(Panzner  et  ah,  1995).  In  addition  to  the  Sec62/63 
complex,  the  heptamer  contains  the  trimer  of  Sec61p. 
This  trimer  (Sec61p,  Ssslp,  Sbhlp  in  yeast;  Sec61a, 
Sec61j3,  Sec61y  in  mammals)  forms  the  aqueous  chan¬ 
nel  through  which  a  polypeptide  chain  is  translocated 
across  the  ER  membrane  (Simon  and  Blobel,  1991; 
Gorlich  et  ah,  1992;  Crowley  et  ah,  1993,  1994;  Mothes 
et  al,  1994;  Hanein  et  al,  1996;  Beckmann  et  al,  1997). 

The  role  of  the  Sec62/63  tetramer  is  less  well  de¬ 
fined.  Cross-linking  and  reconstitution  experiments  in 
vitro  have  shown  that  Sec62p  is  close  to  the  nascent 
polypeptide  chain  before  the  initiation  of  its  translo¬ 
cation  (Musch  et  al,  1992;  Lyman  and  Schekman,  1997; 
Matlack  et  al,  1997).  One  important  role  of  Sec63p  is  its 
ability  to  recruit  the  Hsp70-type  protein  Kar2p  of  the 
ER  lumen  to  the  vicinity  of  a  translocating  polypeptide 
(Brodsky  and  Schekman,  1993;  Lyman  and  Schekman, 
1997).  The  Sec62/63  complex  is  essential  for  the  post- 
translational  translocation  of  proteins  in  reconstituted 
vesicle  preparations  (Panzner  et  al,  1995).  Genetic 
analysis  supports  this  conclusion,  by  showing  that  the 
tetrameric  Sec62/63  complex  is  involved  in  the  trans¬ 
location  of  proteins  whose  targeting  to  the  ER  mem¬ 
brane  is  not  abolished  by  the  loss  of  the  signal  recog¬ 
nition  particle  (SRP)  (Ng  et  al,  1996).  However,  it  is 
less  clear  whether  the  Sec62/63  complex  is  the  recep¬ 
tor  for  the  signal  sequences  of  those  proteins. 

In  the  present  work,  we  demonstrate  the  ability  of 
the  split-Ub  assay  to  detect  transient  protein  interac¬ 
tions  in  living  cells.  We  show  that  the  assay  can  mon¬ 
itor  a  close  proximity  between  Sec62p  and  a  segment 
of  the  nascent  chain  of  a  signal  sequence-bearing  pro¬ 
tein.  The  apparent  extent  of  this  proximity  is  influ¬ 
enced  by  the  nature  of  the  signal  sequence  and  the 
position  of  Cub  in  the  nascent  polypeptide.  Our  anal¬ 
ysis  yields  a  crude  map  of  the  environment  of  the 
nascent  chain  during  its  targeting  to  and  translocation 
across  the  ER  membrane.  Taken  together,  these  find¬ 
ings  are  the  first  in  vivo  evidence  that  Sec62p,  a  com¬ 
ponent  of  the  translocation  machinery  in  the  endo¬ 
plasmic  reticulum,  is  a  part  of  a  signal  sequence¬ 
binding  complex. 

MATERIALS  AND  METHODS 
Construction  of  Test  Proteins 

The  Cub  fusions  8-13  (Figure  2)  were  derived  from  the  construct  I  of 
Johnsson  and  Varshavsky  (1994a),  which  encoded  Ub-DHFR-ha  and 
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Figure  2.  Nub  and  Cub  test  fusions.  (A)  Nub  (residues  1-36  of  Ub)  was  fused  to  the  N  terminus  of  either  a  transmembrane  protein  (constructs 
1-6)  or  a  cytosolic  protein  (construct  7).  The  N  termini  of  all  proteins  are  located  in  the  cytosol.  The  orientations  and  the  numbers  of 
membrane-spanning  regions  (shaded  boxes)  were  derived  from  the  published  studies  of  these  proteins,  except  for  Stel4p,  for  which  the  exact 
number  of  the  domains  and  the  localization  of  the  C  terminus  are  not  yet  known.  The  Nub  fusions  1-5  are  located  in  the  membrane  of  the 
ER;  the  N  lb  fusion  6  resides  in  the  membrane  of  the  early  Golgi.  The  Nub  fusion  2  is  a  Sec62p  derivative  lacking  the  C-terminal  60  residues. 
The  N  b  fusion  7  contains  the  full-length  triosephosphate  isomerase  (Nub-Tpilp).  (B)  Cub  fusions.  The  Cub  fusions  8  and  9  contain  the  signal 
sequence  of  the  prep ro-a-fac tor  (shaded  boxes),  followed  by  either  37  (construct  8)  or  65  residues  (construct  9)  of  the  mature  a-factor  sequence 
(striped  boxes)  and  a  7-residue  linker  sequence  (not  shown).  Cub  fusions  10-13  contain  the  signal  sequence  of  the  Suc2p  invertase  (dark  boxes) 
followed  by  23  (construct  10),  33  (construct  11),  59  (construct  12),  or  518  residues  (construct  13)  of  the  mature  sequence  of  invertase  (open 
boxes)  and  a  7-residue  linker  sequence  (not  shown).  The  Cub  fusion  14  contains  the  complete  sequence  of  S.  cerevisiae  triosephosphate 
isomerase  (Tpilp)  followed  by  a  17-residue  linker  peptide  and  Cub.  The  Cub  fusions  15  and  16  are  the  signal  sequence-lacking  counterparts 
of  the  fusions  10  and  12.  Cub  is  always  followed  by  a  reporter  protein.  The  reporter  is  DHFR-ha  or  Ura3p  for  the  Cub  fusions  8-13,  and 
DHFR-ha  for  the  Cub  fusions  14,  15,  and  16. 


contained  a  BamHl  site  at  the  amino  acid  position  36  of  Ub,  and 
from  the  previously  described  Ub  translocation  constructs  I,  VI,  IX, 
X,  XXIII,  and  XXV  (Johnsson  and  Varshavsky,  1994b).  The  above 
BamHl  site  of  the  Ub -DHFR-ha  construct  I  was  fused  to  a  linker 
sequence  in  which  a  5' -Sail  site  allowed  the  in-frame  insertion  of  an 
Eagl~Sall  fragment  containing  the  promoter,  the  signal  sequence, 
and  a  portion  of  the  mature  sequence  of  the  corresponding  Ub 
fusions.  The  newly  introduced  sequence  was  G  TCG  ACC  ATG 
TCG  GGG  GGG  ATC  CCT.  The  last  three  triplets  encode  residues 
35,  36,  and  37  of  Ub  (the  beginning  of  Cub).  The  underlined  se¬ 
quences  are  the  Sail  and  BamHl  sites,  respectively.  The  final  con¬ 
structs  were  in  the  single-copy  plasmids  pRS314  or  pRS315  (Sikorski 
and  Hieter,  1989).  Expression  of  the  Cub  fusions  bearing  Dha 
(DHFR-ha)  as  a  reporter  was  mediated  by  the  PAdhi  promoter, 
except  for  the  Cub  fusion  14,  which  was  expressed  from  the  PCUP1 
promoter.  The  same  promoter  was  used  for  expressing  the  Ura3p- 
based  Cub  fusions. 

The  Cub  fusions  15  and  16  (Figure  2)  were  derived  from  constructs 
10  and  12  by  deleting  the  HmdIII  fragment  spanning  the  first  four 
codons  of  the  SUC2  ORF  and  a  short  portion  of  the  polylinker 
sequence  between  the  3'-end  of  the  PAdhi  promoter  and  the  SUC2 
ORF.  As  a  result,  the  translation  of  Cub  fusions  15  and  16  began  at 
the  first  codon  of  the  mature  invertase,  skipping  its  signal  sequence. 
The  Cub  fusion  14  (Figure  2)  was  produced  through  an  in-frame 
fusion  of  a  PCR  fragment  containing  the  complete  TPI1  coding 


sequence  and  Cub-Dha.  The  sequence  between  TP11  and  Cub  is  as 
follows:  AAC  GGG  TCG  ACC  GAC  TAC  AAG  GAC  GAC  GAT 
GAC  AAG  GGC  TCG  ACC  ATGTCG  GGG  GGG  ATC  CCT.  The 
underlined  sequences  indicate,  respectively,  the  last  codon  of  TP II 
and  the  first  three  codons  of  Cub. 

A  fragment  encoding  Nub-Sec62p  was  constructed  using  PCR 
amplification  of  a  1050  base  pair  (bp)  fragment  containing  the  SEC62 
ORF.  PCR  introduced  a  BamHl  site  and  a  linker  sequence  in  front  of 
the  start  codon  of  SEC62  and  an  Xhol  site  173  bp  downstream  of  the 
stop  codon.  The  PCUPrNllb  modules  were  cloned  as  BamHl  frag¬ 
ments  in  frame  with  the  SEC62  ORF.  The  sequence  between  Nub  and 
SEC62  is  GGG  ATC  CCT  TCT  GGG  ATG.  The  first  three  codons 
encode  residues  35, 36,  and  37  of  Nub,  followed  by  the  Gly-Ser  linker 
and  the  start  codon  of  SEC62 .  The  BamHl  site  is  underlined.  The 
final  constructs  resided  in  pRS316  or  pRS313.  Nub~TPIl,  Nub-SED5, 
N Ub-STE14  (a  gift  from  N.  Lewke),  and  Nub-Sec62(AC60)-Dha  were 
constructed  similarly  to  Nub-SEC62.  With  the  exception  of  Nllb- 
Sec62(AC60)-Dha,  which  was  placed  in  pRS316  and  pRS313,  all  of 
these  fusions  resided  in  pRS314.  The  linker  connecting  codon  35  of 
Nub  and  the  first  codon  of  a  linked  gene  was  GGG  ATC  CCT  GGG 
GAT  ATG  for  Nub-TPn  and  Nub-SED5,  and  GGG  ATC  CCT  GGG 
GAT  CAC  for  N ub-STE14.  Underlined  are  the  BamHl  site  and  the 
first  codon  of  the  linked  gene.  The  sequence  GGG  TCG  ACC  TTA 
ATG  CAG  AGA  TCT  GGC  ATC  ATG  GTT  connected  the  last  codon 
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Table  1.  Yeast  strains 

Strain 

Relevant  genotype 

Source  /  comment 

YPH500 

MAT  a  ade2-101  his3-A200  Ieu2-Al  lys2-801  trpl-A63  ura3-52 

Sikorski  and  Hieter  (1989) 

JD53 

MAT  a  his3-A200  leu2-3,112  lys2-801  trpl~A63  ura3-52 

Dohmen  et  al  (1995) 

RSY529 

MATa  his4  leu2-3,112  ura3-52  sec62-l 

Deshaies  and  Schekman  (1989) 

NJY51 

MATa  ade2-101  his3-A200  Ieu2-Al  lys2-801  trpl~A63  ura3-52 
NUB-BOSl::pRS306 

Derivative  of  YPH500 

NJY62-1 

MATa  his3-A200  leu2-3,112  lys2-801  trpl-A63  ura3~52 
NUB-SEC62::pRS306 

Derivative  of  JD53 

NJY73-I 

MATa  his3-A200  leu2-3,112  lys2-801  trpl-A63  ura3~52 
NUB-BOSl::pRS303 

Derivative  of  JD53 

NJY74-I 

MATa  hi$3-A200  leu2-3,112  lys2-801  trpl~A63  ura3-52 
NUB-BOSl::pRS306 

Derivative  of  JD53 

NJY61-I 

MATa  his3-A200  leu2~3,112  lys2-80l  trpHA63  ura3-52 
NUB-SEC61::pRS304 

Derivative  of  JD53 

NJY61-A 

MATa  his3~A200  leu2-3,112  lys2-801  trpl-A63  ura3-52 

NUA-SEC61  ::pRS304 

Derivative  of  JD53 

NJY61-G 

MATa  his3-A200  leu2-3,112  lys2-801  trpl-A63  ura3-52 
NUG-SEC61::pRS304 

Derivative  of  JD53 

of  SEC62  in  Nub-Sec62(AC60)-Dha  (codon  223,  underlined)  to  the 
first  two  codons  of  DHFR  (underlined). 

Nuh-BOSl  was  constructed  in  part  by  PCR  amplification,  with 
two  synthetic  oligos  and  yeast  genomic  DNA  as  a  template,  yielding 
a  258-bp  fragment  containing  the  first  229  bp  of  the  BOS1  ORF. 
Upstream  of  the  BOS1  ATG  was  a  short  linker  sequence  and  a 
BamHl  site,  to  allow  in-frame  fusion  of  the  Pcupi  promoter-Nub 
module.  The  sequence  between  Nub  and  BOS1  reads:  GGG  ATC 
CCT  CCA  GGA  ATG.  The  first  four  triplets  encode  residues  35,  36, 
37,  and  38  of  Nub,  followed  by  the  Gly  codon  and  the  start  codon  of 
BOS1.  The  BamHl  site  is  underlined.  The  3'-region  of  the  resulting 
fragment  terminated  in  a  Sail  site  for  insertion  into  the  integrating 
vectors  pRS306  or  pRS303.  The  vector  was  cut  at  the  unique  EcoBl 
site  in  the  BOSl-containing  fragment  and  transformed  into  S.  cer- 
evisiae  strains  YPH500  and  JD53  to  produce,  through  homologous 
recombination,  the  integrated  cassette  that  expressed  Nub-Boslp 
from  the  PCUP1  promoter.  The  presence  of  the  desired  gene  fusion 
and  the  absence  of  wild-type  BOS1  were  verified  by  PCR. 

An  integrated  copy  of  PCUP1-Nub-S£C62  was  produced  by  ampli¬ 
fying  the  first  438  bp  of  the  SEC62  ORF,  and  then  cloning  it,  using 
the  BamHl  and  EcoRI  restriction  sites,  in  frame  behind  the  pRS306- 
PCuprNub  cassette.  A  unique  Aflll  site  in  the  SEC62  ORF  was  used 
to  linearize  the  plasmid  for  transformation  and  integration  at  the  S. 
cerevisiae  SEC62  gene,  yielding  the  strain  NJY62-I.  The  N-terminal 
147-residue  fragment  of  Sec62p  that  was  coexpressed  with  Nub- 
Sec62p  in  the  resulting  strain  has  previously  been  shown  to  be 
inactive  in  translocation  (Deshaies  and  Schekman,  1990).  N uh-SEC61 
was  constructed  by  targeted  integration  of  a  Nuh-SEC61- containing 
fragment  into  SEC61  of  the  S.  cerevisiae  strain  JD53  (Table  1).  Spe¬ 
cifically,  a  fragment  containing  the  first  875  bp  of  the  SEC61  ORF 
was  amplified  by  PCR  and  inserted  downstream  of  the  pRS304-  or 
pRS303-based  Pcupi"Nub  cassette,  using  the  flanking  BamHl  and 
£coRI  sites.  The  linker  sequence  between  Nub  and  SEC61  was  GGG 
ATC  CCT  GGG  TCT  GGG  ATG.  Underlined  are  the  BamHl  site  and 
the  first  codon  of  SEC61.  For  targeted  integration,  the  plasmid  was 
linearized  at  the  unique  Stul  site  in  the  SEC61  ORF  to  create  the 
yeast  NJY61-I.  A  detailed  description  of  the  NJY61  strains  (Table  1) 
will  be  presented  elsewhere  (Wittke  and  Johnsson,  unpublished 
data). 

All  of  the  PCupi  promoter-controlled  ORFs  were  expressed  under 
noninducing  conditions  (no  copper  added  to  the  medium),  except  in 
the  experiment  shown  in  Figure  5B,  where  cells  were  incubated  in 
the  presence  of  0.1  mM  CuS04. 


Immunoblotting 

Proteins  fractionated  by  SDS-12.5%  PAGE  were  electroblotted  onto 
nitrocellulose  (Schleicher  &  Schuell,  Dassel,  Germany)  or  polyvi- 
nylidene  difluoride  (Machery-Nagel,  Diiren,  Germany)  membranes, 
using  the  semidry  transfer  system  (Hoeffer  Pharmacia  Biotech,  San 
Francisco,  CA).  Blots  were  incubated  with  an  anti-ha  monoclonal 
antibody  (Babco,  Richmond,  CA)  and  visualized  using  horseradish 
peroxidase-coupled  goat  anti-mouse  antibody  (Bio-Rad,  Hercules, 
CA),  the  chemiluminescence  detection  system  (Boehringer,  Mann¬ 
heim,  Germany),  and  x-ray  films.  Where  indicated,  quantification 
was  performed  using  the  Lumi  Imager  system  (Boehringer). 

Pulse-Chase  Analysis 

Yeast-rich  (YPD)  and  synthetic  minimal  media  with  2%  dextrose 
(SD)  were  prepared  as  described  previously  (Dohmen  et  al,  1995).  S. 
cerevisiae  cells  expressing  the  Nub  and  Cub  fusions  were  grown  at 
30°C  in  10  ml  of  SD  medium  without  externally  added  copper  to  an 
OD600  of  ~1  and  labeled  for  5  min  with  Redivue  Promix-[35S] 
(Amersham,  Buckinghamshire,  United  Kingdom),  followed  (either 
directly  or  after  a  chase)  by  immunoprecipitation  with  the  anti-ha 
monoclonal  antibody,  essentially  as  described  by  Johnsson  and 
Varshavsky  (1994a, b).  The  EndoH  analysis  of  glycosylated  proteins 
was  carried  out  as  described  by  Or  lean  et  al.  (1991).  Samples  were 
concentrated  before  SDS-12.5%  PAGE  by  precipitation  with  chloro¬ 
form/methanol.  Gels  were  fixed  and  enhanced  for  fluorography. 
For  quantitative  analysis,  a  dried  gel  was  exposed  and  scanned 
using  a  Phosphorlmager  (Molecular  Dynamics,  Sunnyvale,  CA). 


RESULTS 

Experimental  Strategy 

The  use  of  split  Ub  to  monitor  the  proximity  between 
the  proteins  X  and  Y  requires  the  construction  of  two 
"complementary"  fusion  proteins.  One  fusion  bears 
Nub  (see  INTRODUCTION)  linked  to  X  (Nub-X)  and 
the  other  bears  Cub  linked  to  both  Y  and  a  reporter 
protein  R  at  the  C  terminus  of  Cub  (Y-Cub-R).  The 
liberation  of  the  reporter  through  the  Ub-dependent 
cleavage  by  UBPs  indicates  the  in  vivo  reconstitution 
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of  a  quasi-native  Ub  from  Nub  and  Cub.  In  the  split-Ub 
assay,  the  efficiency  of  cleavage  at  the  C  terminus  of 
CLlb  in  Y-Cub-R  is  measured  relative  to  the  efficiency  of 
cleavage  observed  with  selected  reference  (control) 
proteins  (Figure  1). 

To  monitor  protein  interactions  during  translocation 
of  a  protein  across  the  ER  membrane,  Nub  was  fused  to 
the  N  terminus  of  a  membrane  protein  that  is  a  part  of 
the  translocation  machinery  (Figure  1).  Owing  to  the 
constraint  of  the  assay,  which  requires  the  cytosolic 
location  of  the  reconstituted  Ub,  the  N  terminus  of  this 
membrane  protein  must  be  located  in  the  celFs  cy¬ 
tosol.  Sec62p  has  an  N-terminal  cytosolic  domain  of 
158  residues,  which  is  followed  by  two  membrane- 
spanning  segments  and  a  C-terminal  segment  also 
facing  the  cytosol  (Deshaies  and  Schekman,  1990).  Nub 
was  therefore  fused  to  the  N  terminus  of  Sec62p, 
yielding  Nub-Sec62p.  Cub  was  sandwiched  between 
the  56  N-terminal  residues  of  the  precursor  of  S.  cer- 
evisiae  a-factor  pheromone  (prepro-a-factor)  and  the 
ha  epitope-tagged  mouse  dihydrofolate  reductase 
(DHFR-ha;  denoted  as  Dha)  as  a  reporter  protein, 
yielding  Mfa37-Cub-Dha  (Figure  2).  The  cleavage  of  the 
Cub-containing  fusion  at  the  Cub-Dha  junction  was 
detected  with  a  monoclonal  anti-ha  antibody. 


Split-Ub  Detects  a  Proximity  between  a 
Translocating  Protein  and  Sec62p 

We  first  verified  that  Mfa37-Cub-Dha  could  be  translo¬ 
cated  across  the  ER  membrane  and  that  the  N-terminal 
extension  of  Sec62p  with  Nub  did  not  interfere  with 
the  Sec62p  function  in  translocation.  After  a  5-min 
pulse  of  wild-type  S.  cerevisiae  with  35S-methionine, 
the  labeled  Mfa37-Cub-Dha  was  immunoprecipitated 
as  a  glycosylated  and  unclipped  fusion  (Figure  3A). 
Thus,  Mfa37-Cub-Dha  could  indeed  be  translocated 
into  the  lumen  of  ER.  Introduction  of  the  same  Mfa37- 
Cub-Dha  construct  into  the  yeast  strain  RSY529,  which 
carries  a  temperature-sensitive  (ts)  variant  of  Sec62p 
(Rothblatt  et  al,  1989),  confirmed  the  severe  transloca¬ 
tion  defect  of  this  strain.  About  50%  of  the  pulse- 
labeled  Mfa37-Cub-Dha  entered  the  lumen  of  the  ER  in 
this  strain  at  the  semipermissive  temperature  of  30°C, 
while  the  rest  remained  in  the  cytosol  (Figure  3A). 
Thus,  the  translocation  of  Mfa37-Cub-Dha  depends  on 
Sec62p.  This  made  it  possible  to  determine  whether 
Nub-Sec62p  is  functionally  active.  The  test  utilized 
Nug-Sec62p,  in  which  the  N-terminal  half  of  Ub  con¬ 
tained  Gly-13  instead  of  wild-type  Ile-13.  This  deriv¬ 
ative,  denoted  as  Nug,  has  a  lower  affinity  for  CLlb  than 
the  wild-type  Nub  (Johnsson  and  Varshavsky,  1994a). 
We  chose  Nug-Sec62p  for  this  experiment  to  minimize 
the  reconstitution  of  the  Ub  moiety  through  interac¬ 
tions  between  Nub-Sec62p  and  potentially  arrested 
molecules  of  Mfa37-Cub-Dha,  which  might  be  localized 
in  the  cytosol.  Plasmids  expressing  Nug-Sec62p  and 


EndoH:  —  +  —  +  —  + _ _ + 


a  b  c  d  e  f  g  h 


— 

Sec62p 

O 

NUb-Sec62p 

O 

Sec62(AQ0O)Dha 

NUb-Sec62(AQ6o)Dha 

Figure  3.  Sec62p  is  close  to  the  signal  sequence  of  the  a-factor 
precursor.  (A)  S.  cerevisiae  cells  expressing  Mfa37-Cub-Dha  (construct 
8;  Figure  2)  were  labeled  for  5  min  with  35S-methionine.  The  ex¬ 
tracted  proteins  were  immunoprecipitated  with  anti-ha  antibody, 
followed  by  a  mock  treatment  (lanes  a,  c,  e,  and  g)  or  the  treatment 
with  EndoH  (lanes  b,  d,  f,  and  h),  and  SDS-PAGE.  The  results  with 
cells  coexpressing  N ug-  or  Nub-Sec62p  are  shown  in  lanes  c  and  d 
and  g  and  h,  respectively.  The  analysis  was  performed  with  Nug- 
Sec62p  in  the  S.  cerevisiae  mutant  RSY529  carrying  a  ts  allele  of 
SEC62  (lanes  a-d)  or  with  Nub-Sec62p  in  the  wild-type  yeast  (lanes 
e-h).  Number  8  (following  the  numbering  of  the  constructs  in 
Figure  2)  on  the  right  indicates  the  positions  of  uncleaved  Mfa37- 
Cub-Dha  and  its  glycosylated  forms.  An  asterisk  denotes  an  unre¬ 
lated  yeast  protein  that  cross-reacts  with  the  anti-ha  antibody.  (B) 
Nub-Sec62p  encodes  a  functionally  active  protein.  RSY529  cells  car¬ 
rying  an  empty  plasmid  (a),  Sec62p  (b),  Nub-Sec62p  (c), 
Sec62(AC60)Dha  (d),  or  Nub-Sec62(AC60)Dha  (e)  were  spotted  on 
minimal  media  and  grown  for  2  d  at  30°C  (semipermissive  temper¬ 
ature  for  unmodified  RSY529). 


Mfa37-Cub-Dha  were  cotransformed  into  RSY529  cells 
and  assayed  at  30°C.  As  in  wild-type  cells,  only  trans¬ 
located  Mfa37-Cub-Dha,  but  virtually  no  free  Dha  or 
nontranslocated  Mfa37-Cub-Dha,  was  detected  after 
immunoprecipitation  and  EndoH  treatment  of  the 
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cells  that  had  been  labeled  for  5  min  with  35S-methi- 
onine  (Figure  3A). 

To  test  Nub-Sec62p  directly  (Nub  is  the  wild-type  half 
of  Ub,  containing  lie  at  position  13),  we  examined  its 
ability  to  complement  the  growth  defect  of  RSY529 
cells.  RSY529  cells  expressing  Nub-Sec62p  were  found 
to  grow  at  the  semipermissive  temperature  of  30°C,  in 
contrast  to  congenic  cells  carrying  a  control  plasmid 
(Figure  3B).  To  verify  that  the  suppression  of  the  ts 
phenotype  was  not  due  to  the  initiation  of  translation 
from  the  first  (internal)  ATG  codon  of  Sec62p  within 
the  Nub-Sec62p  fusion,  the  rescue  experiment  was  suc¬ 
cessfully  repeated  with  the  otherwise  identical  deriv¬ 
ative  of  Nub-Sec62p  that  lacked  the  first  ATG  of  SEC62 
(our  unpublished  results). 

A  significant  amount  of  free  Dha  was  generated 
when  Mfa37-Cub-Dha  was  expressed  (in  either  wild- 
type  or  RSY529  cells)  together  with  Nub-Sec62p,  which 
contained  the  wild-type  half  of  Ub  (Figure  3A  and  our 
unpublished  results).  We  concluded  that  Sec62p  is 
close  to  the  nascent  polypeptide  chain  during  its  trans¬ 
location  into  the  ER.  The  cleavage  at  the  C  terminus  of 
Cub  requires  its  interaction  with  Nub  and  depends  on 
the  presence  of  UBPs  (Johnsson  and  Varshavsky, 
1994a).  Since  UBPs  have  previously  been  shown  to  be 
absent  from  the  ER  (Johnsson  and  Varshavsky,  1994b), 
the  free  Dha  moiety  had  to  be  produced  in  the  cytosol. 
Fractionation  experiments  confirmed  that  free  Dha 
was  absent  from  membrane-enclosed  compartments 
in  whole-cell  extracts  (our  unpublished  results).  An 
entirely  independent  evidence  for  this  conclusion  was 
produced  by  replacing  Dha  in  Mfa37-Cub-Dha  with 
Ura3p  as  the  reporter  moiety.  Ura3p  confers  the  Ura+ 
phenotype  on  ura3A  cells  only  if  Ura3p  has  access  to 
the  cytosol  (Johnsson  and  Varshavsky,  1994b).  In  our 
tests,  the  cytosolic  Ura3p  was  produced  only  if  Mfn37- 
Cub-Ura3p  was  coexpressed  with  Nub-Sec62p  (com¬ 
pare  A  and  B  in  Figure  7),  in  agreement  with  the  other 
evidence  (see  above)  that  the  cleavage  at  the  Cub- 
protein  junction  takes  place  exclusively  in  the  cytosol. 

The  transient  nature  of  the  proximity  between 
Sec62p  and  the  nascent  chain  of  a  translocated  protein 
was  indicated  by  the  near-absence  of  the  released  Dha 
moiety  if  Mfa37-Cub~Dha  was  coexpressed  with  either 
Nug-Sec62p  or  Nua-Sec62p  instead  of  Nub-Sec62p  (Nua 
denotes  Ala  at  position  13  of  Nub);  by  contrast,  the 
same  experiment  with  Nub-Sec62p  resulted  in  a  signif¬ 
icant  cleavage  of  Mfa37-Cub-Dha  (Figures  3 A  and  6C). 
Previous  work  (Johnsson  and  Varshavsky,  1994a)  has 
shown  that  Nua  and  Nug  can  induce  significant  Ub 
reconstitution  when  either  of  them  and  Cub  are  linked 
to  polypeptides  that  form  a  stable  (long-lived)  com¬ 
plex  in  a  cell.  In  summary,  the  observed  absence  of 
significant  Ub  reconstitution  with  Nua  and  Nug  (in 
contrast  to  Nub)  was  interpreted  to  signify  a  close  but 
transient  (short-lived)  proximity  between  Sec62p  and 
Mfa37-Cub“Dha. 


Specificity  of  the  Spatial  Proximity  between  a 
Signal  Sequence-bearing  Nascent  Polypeptide  and 
Sec62p 

A  commonly  used  negative  control  in  a  translocation 
assay  is  a  protein  with  a  defective  or  absent  signal 
sequence  (Allison  and  Young,  1988;  Miisch  et  al , 
1992).  Such  a  control  is  not  entirely  compatible  with 
spatio-temporal  aspects  of  the  split-Ub  assay.  Specifi¬ 
cally,  a  Cub-fusion  protein  lacking  a  signal  sequence 
accumulates  in  the  cytosol  (where  the  split-Ub  assay 
operates),  whereas  an  analogous  signal  sequence¬ 
bearing  protein  is  continuously  removed  from  this 
compartment.  A  direct  comparison  between  reactions 
that  involve  a  signal  sequence-bearing  polypeptide 
and  its  signal  sequence-lacking  counterpart  requires 
the  ability  to  compare  the  local  concentrations  of  the 
two  polypeptides  at  the  site  of  translocation.  We  are 
not  aware  of  an  in  vivo  technique  that  would  be 
independent  of  the  split-Ub  assay  and  at  the  same 
time  would  allow  a  measurement  of  these  parameters. 
Therefore,  we  devised  an  alternative  control.  The  ex¬ 
tent  of  cleavage  of  Mfa37-Cub-Dha  at  the  Cub-Dha 
junction  should  reflect  the  time-averaged  spatial  prox¬ 
imity  between  the  nascent  Mfa-chain  and  a  coex¬ 
pressed  Nub-containing  fusion.  By  comparing  the  ex¬ 
tent  of  cleavage  of  Mfa37-Cub-Dha  in  the  presence  of 
Nub-Sec62p  (Figure  3A)  with  the  analogous  activity  of 
Nub-fusion  proteins  that  are  not  involved  in  the  ER 
targeting  and  translocation,  we  could  assess  the  spec¬ 
ificity  of  the  reaction  between  Nub-Sec62p  and  Mfa37- 
Cub-Dha. 

Four  Nub-fusion  proteins,  Nub-Boslp,  Nub-Stel4p, 
Nub-Sed5p,  and  Nub-Tpilp  were  tested  in  the  split-Ub 
assay  with  Mfn37-Cub-Dha.  The  expected  intracellular 
locations  of  these  Nub  fusions,  and  their  predicted 
topologies  in  the  membrane  are  shown  in  Figure  2A 
(Shim  et  al ,  1991;  Banfield  et  al. ,  1994;  Sapperstein  et 
al. ,  1994;  Lewke  and  Johnsson,  unpublished  data).  We 
found  that,  in  contrast  to  Nub-Sec62p,  none  of  the  four 
tested  Nub  fusions  induced  a  significant  cleavage  of 
Mfa37-Cub-Dha  (Figure  4A).  A  small  amount  of  free 
Dha  could  be  detected  in  the  immunoblots  when  Nub- 
Boslp  was  overexpressed.  The  lack  of  significant  Ub 
reconstitution  from  Nub  and  Cub  upon  coexpression  of 
Mfa37-Cub-Dha  and  the  Nub-modified  ER  membrane 
proteins,  Boslp,  Stel4p  (Figure  4A),  and  Secl2p  (Na- 
kano  et  al,  1988;  our  unpublished  results),  confirmed 
that  the  steady-state  concentration  of  Mfa37-Cub-Dha 
in  the  cytosol  was  extremely  low. 

To  verify  that  the  observed  absence  of  Ub  reconsti¬ 
tution  (Figure  4A)  was  not  due  to  either  low  concen¬ 
trations  of  the  tested  fusion  proteins  or  reduced  acces¬ 
sibility  of  their  linked  Nub  moieties,  we  compared  the 
activity  of  these  Nub  fusions  toward  a  cytosolic  ef¬ 
fusion  protein.  Cub-Dha  was  fused  to  the  C  terminus 
of  the  cytosolic  enzyme  triosephosphate  isomerase 
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Figure  4.  The  in  vivo  proxim¬ 
ity  between  Sec62p  and  Mfa37- 
Cub-Dha  is  transient  and  spe¬ 
cific.  (A)  Immunoblot  analysis 
of  extracts  of  S.  cerevisiae  coex¬ 
pressing  the  Mfa37-Cub-Dha 
(construct  8;  Figure  2)  and  one 
of  the  following  constructs: 
Nub-Sec62p,  integrated  (lane  a) 
or  plasmid -borne  (lane  b);  Nub- 
Boslp  (lane  c);  Nub-Stel4p  (lane 
d);  Nub-Sed5p(lane  e);  and  Nub- 
Tpilp  (lane  f).  (B)  Immunoblot 
analysis  of  extracts  of  S.  cerevi¬ 
siae  expressing  the  Tpilp-Cub- 
Dha  fusion  (construct  14;  Figure 
2)  alone  (lane  a)  or  together 
with  one  of  the  following  con¬ 
structs:  Nllb-Sec62p/  either  inte¬ 
grated  (lane  b)  or  plasmid- 
borne  (lane  c);  Nub-Boslp  (lane 
d);  Nub-Stel4p  (lane  e);  Ntlb- 
Sed5p(lane  f);  and  NLlb-Tpilp 
(lane  g).  Number  14  on  the  left 
indicates  the  position  of  un¬ 
cleaved  Tpilp-Cub-Dha.  (C)  S. 
cerevisiae  cells  expressing 
Mfa37-Cllb-Dha  (construct  8; 
Figure  2)  together  with  either 
the  vector  (lane  a),  Nub-Boslp 
(lane  b),  or  Nub-Sec62p  (lane  c) 
were  labeled  for  5  min  with  35S- 
methionine.  The  extracted  pro¬ 
teins  were  immunoprecipitated 
with  anti-ha  antibody  and  ana¬ 
lyzed  by  SDS-PAGE.  (D)  Quan¬ 
titation  of  the  pulse-labeling  ex¬ 
periment  (C)  using  Phosphor- 
Imager.  The  extent  of  Dha  re¬ 
lease  in  the  presence  of  Nub- 
Sec62p  was  arbitrarily  set  at 
100.  The  averages  of  three  ex¬ 
periments  are  shown.  Lanes  a. 


abcdef  abcdefg 


a  b  C  a  b  C  chase  time  (min) 


b,  and  c  are  the  same  as  in  panel 

C.  (E)  S.  cerevisiae  cells  expressing  Mfa37-Cub-Dha  together  with  Nub-Sec62p  were  labeled  for  5  min  with  35S-methionine  and  chased  for 
5  and  15  min,  followed  by  extraction  of  proteins,  immunoprecipitation  with  anti-ha  antibody,  and  SDS-PAGE. 


(Tpilp),  yielding  Tpil-Cub-Dha  (Figure  2B).  All  of  the 
Nub-fusion  proteins  in  Figure  2A  induced  a  significant 
release  of  Dha  from  the  test  protein  Tpil-Cub-Dha 
(Figure  4B).  This  analysis  also  suggested  that  Nub- 
Boslp  was  expressed  to  higher  levels  than  other  Nub 
fusions. 

To  quantify  the  relative  proximities  of  Nub-Boslp 
and  Nub-Sec62p  to  Mfa37-Cub-Dha,  yeast  cells  were 
labeled  for  5  min  with  35S-methionine/  and  the  re¬ 
leased  Dha  was  determined  as  described  in  the  legend 
to  Figure  4.  Coexpression  of  Nub-Sec62p  and  Mfa37- 
Cub-Dha  yielded  —15  times  more  of  the  free  Dha  than 
coexpression  of  Nub-Boslp  and  Mfa37-Cub-Dha  (Fig¬ 
ure  4,  C  and  D).  Assuming  that  the  Nub  moieties  in 
Nub-Sec62p  and  Nub-Boslp  were  equally  accessible  to 
the  cytosol  (Figure  4B),  we  concluded  that  the  time- 
averaged  proximity  between  the  nascent  chain  of 


Mfa37-Cub-Dha  and  the  Nub-bearing  transmembrane 
proteins  was  much  higher  for  Sec62p  than  for  the  ER 
membrane  proteins  that  are  not  involved  in  targeting 
or  translocation.  Note  that  this  analysis  may  actually 
underestimate  the  proximity  of  Sec62p  to  the  nascent 
chain,  because  we  invariably  observed  a  more  efficient 
cleavage  of  Mfn37-Cub-Dha  when  Nub-Sec62p  was  the 
only  form  of  Sec62p  in  the  cell  (Figures  4 A  and  6A). 
Therefore  we  interpret  the  reduced  cleavage  of  Mfa37- 
Cub-Dha  in  the  presence  of  both  Nub-Sec62p  and  the 
native  Sec62p  as  the  consequence  of  competition  be¬ 
tween  those  two  Sec62p-containing  species  for  either 
the  signal  sequences  of  translocated  proteins  or  the 
ligands  of  Sec62p  in  the  complex  of  Sec  proteins. 

Recent  evidence  indicates  that  misfolded  or  other¬ 
wise  abnormal  proteins  in  the  lumen  of  the  ER  can  be 
retrotransported  across  the  ER  membrane  back  into 
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the  cytosol,  where  they  are  degraded  by  the  Ub  system 
(Biederer  et  al,  1996;  Hiller  et  al,  1996;  Wiertz  et  al, 
1996).  This  retro  transport  involves  at  least  some  of  the 
components  of  the  known  ER  translocation  machinery 
(Plemper  et  al,  1997).  To  determine  whether  the  cleav¬ 
age  of  Mfa37-Cub-Dha  at  the  Cub-Dha  junction  occurs 


Figure  5.  The  nature  of  the  sig¬ 
nal  sequence  and  its  distance 
from  Cub  determine  the  extent  of 
cleavage  of  Cub-R  in  the  presence 
of  Nub-Sec62p.  (A)  S.  cerevisiae  ex¬ 
pressing  Mfa37-Cub-Dha  (con¬ 
struct  8;  Figure  2)  (lanes  a  and  b), 
Mfa65-Cub-Dha  (construct  9) 
(lanes  c  and  d),  Suc223-Cub-Dha 
(construct  10)  (lanes  e  and  f), 
Suc233-Cub-Dha  (construct  11) 
(lanes  g  and  h),  Suc259-Cub-Dha 
(construct  12)  (lanes  i  and  j)  and 
Suc2518-Cub-Dha  (construct  13) 
(lanes  k  and  1)  were  labeled  with 
35S-methionine  for  5  min.  The  ex¬ 
tracted  proteins  were  either 
mock-treated  (lanes  a,  c,  e,  g,  i, 
and  k)  or  treated  with  EndoH 
(lanes  b,  d,  f,  h,  j,  and  1),  followed 
by  immunoprecipitation  with  an- 
ti-ha  antibody  and  SDS-PAGE. 

(B)  Same  as  panel  A  but  the  cells 
also  contained  Nub-Sec62p  in  ad¬ 
dition  to  the  Cub-fusions  Mfa37- 
Cub-Dha,  Mfa65-Cub-Dha,  Suc223- 
Cub-Dha,  Suc233-Cub-Dha,  Suc259- 
Cub-Dha,  Suc2518-Cub-Dha  (lanes 
a-f).  The  analysis  was  carried  out 
by  immunoblotting  whole-  cell 
extracts  with  the  anti-ha  antibody. 

(C)  S.  cerevisiae  cells  expressing 
Suc223-Cub-Dha  (construct  10; 
Figure  2)  (lanes  a-c)  and  Suc259- 
Cub-Dha  (construct  12;  Figure  2) 
(lanes  d-f)  together  with  either 
Nub-Sec62p  (lanes  b  and  e),  Nub- 
Boslp  (lanes  c  and  f)  or  the  vector 
(lanes  a  and  d)  were  labeled  for  5 
min  with  35S-methionine.  Whole- 
cell  extracts  were  immunoprecipi- 
tated  with  anti-ha  antibody,  fol¬ 
lowed  by  SDS-PAGE  and 
autoradiography.  (D)  S.  cerevisiae 
cells  expressing  ASucZ^-C^-Dha 
(construct  15;  Figure  2)  or 
ASuc259-Cub-Dha  (construct  16; 
Figure  2)  together  with  either  the 
vector  (first  six  lanes)  or  Nub- 
Sec62p  (last  six  lanes)  were  la¬ 
beled  for  5  min  with  35S-methio- 
nine  and  chased  for  10  and  30 
min,  followed  by  extraction,  im¬ 
munoprecipitation  with  anti-ha 
antibody,  and  SDS-PAGE.  Num¬ 
bers  15  and  16  indicate  the  posi¬ 
tions  of  the  corresponding  (un¬ 
cleaved)  Cub  fusions. 


during  translocation  into  the  ER  or  during  (in  this 
case)  a  hypothetical  retrotransport  from  the  ER,  cells 
coexpressing  Nub-Sec62p  and  Mfn37-Cub-Dha  were  la¬ 
beled  for  5  min  with  35S-methionine,  and  then  chased 
for  15  min  (Figure  4E).  Although  the  translocated 
Mfa37-Cub-Dha  disappeared  rapidly  during  the  chase. 


336 


Molecular  Biology  of  the  Cell 


Interaction  between  Signal  Sequence  and  Sec62p 


the  amount  of  free  Dha  that  accumulated  during  the 
pulse  remained  constant. 

We  conclude  that  the  in  vivo  proximity  between 
Sec62p  and  Mfa37-Cub-Dha  that  is  detected  by  the 
split-Ub  assay  occurs  either  during  or  very  shortly 
after  the  synthesis  of  Mfa37-Cub-Dha.  The  apparent 
disappearance  of  the  pulse-labeled,  translocated 
Mfa37-Cub-Dha  during  the  chase  accounts  for  the  dif¬ 
ficulty  in  detecting  this  species  by  a  steady-state  assay 
such  as  immunoblotting  (Figures  4A,  5B,  and  6A).  The 
likely  cause  of  the  disappearance  of  translocated 
Mfa37-Cub-Dha  is  its  molecular  mass  heterogeneity, 
owing  to  its  glycosylation,  which  results  in  a  smear 
upon  SDS-PAGE  (Figures  3A,  4C,  and  4E). 

The  Efficiency  of  Ub  Reconstitution  Mediated  by 
Nub-Sec62p  Depends  on  Both  the  Identity  of  a  Signal 
Sequence  and  the  Position  of  Cub  in  the  Nascent 
Polypeptide  Chain 

The  proximity  of  Sec62p  to  the  signal  sequence  of 
Mfn37-Cub-Dha  is  detected,  in  the  split-Ub  assay, 
through  the  ability  of  Nub-Sec62p  to  induce  the  cleav¬ 
age  of  Mfa37-Cub-Dha  at  the  Cub-Dha  junction  (Figure 
3A).  If  this  cleavage  reflects  the  physical  proximity 
between  Sec62p  and  a  signal  sequence,  the  efficiency 
of  cleavage  should  decrease  if  the  Cub  moiety  is  moved 
closer  to  the  C  terminus  of  the  nascent  polypeptide 
chain.  However,  this  purely  spatial  consideration  ne¬ 
glects  the  temporal  aspect  of  the  translocation  process 
(Walter  and  Johnson,  1994).  The  targeting  and  the 
actual  translocation  are  initiated  during  or  shortly  af¬ 
ter  the  synthesis  of  a  signal  sequence-bearing  protein. 
Consequently,  the  C-terminal  parts  of  the  nascent 
chain  may  still  be  synthesized,  or  at  least  associated 
with  the  ribosome,  at  the  time  when  Sec62p  and  the 
signal  sequence  have  already  become  spatially  close. 
Extending  the  spacer  would  increase  the  distance  be¬ 
tween  CLlb  and  the  signal  sequence  of  Mfa37-Cub-Dha. 
This  would  be  expected  to  decrease  the  time  window 
available  for  the  interaction  between  the  Cub  moiety  of 
Mfn37-Cub-Dha  and  the  Nub  moiety  of  Nub-Sec62p. 
Therefore,  a  test  of  this  kind  cannot  deconvolute  the 
contribution  of  each  of  the  two  parameters  (increased 
spatial  distance  along  the  chain  between  Sec62p  and 
Cub  and  decreased  time  window  for  the  Nub-Cub  in¬ 
teraction)  to  the  overall  effect  of  extending  the  length 
of  the  polypeptide  between  the  signal  sequence  and 
Cub.  These  constraints  notwithstanding,  moving  the 
Cub  moiety  of  Mfa37-Cub-Dha  further  away  from  its 
signal  sequence  makes  it  possible  to  gauge  the  acces¬ 
sibility  of  Sec62p  to  specific  regions  of  the  nascent 
polypeptide  chain  in  vivo. 

In  the  actual  experiment,  the  distance  between  the 
signal  sequence  of  Mfa37-Cub-Dha  and  its  Cub  moiety 
was  increased  from  37  to  65  residues  (Mfn65-Cub-Dha; 
Figure  2B,  construct  9).  The  results  of  EndoH  treat¬ 


ment  of  Mfa65-Cub-Dha  immunoprecipitated  from 
pulse-labeled  wild-type  cells  confirmed  that  Mfn65- 
Cub-Dha  was  efficiently  translocated  into  the  ER  (Fig¬ 
ure  5A).  However,  the  efficiency  of  the  Dha-yielding 
cleavage  of  Mfa65-Cub-Dha  upon  coexpression  of  Nub- 
Sec62p  was  clearly  reduced  in  comparison  to  the  same 
cleavage  with  Mfa37-Cub-Dha  and  Nub-Sec62p  (Figure 
5B). 

Both  the  kinetics  and  the  mode  of  targeting  for 
translocation  are  influenced  by  the  identity  of  a  signal 
sequence  (Bird  et  al,  1987;  Johnsson  and  Varshavsky, 
1994b;  Ng  et  al,  1996).  For  example,  the  efficient  trans¬ 
location  of  invertase  (Suc2p)  requires  the  SRP,  in  con¬ 
trast  to  a  much  weaker  requirement  for  SRP  in  the  case 
of  the  prepro-a-factor's  signal  sequence  (Hann  and 
Walter,  1991;  Ogg  et  al,  1992;  Johnsson  and  Var¬ 
shavsky,  1994b).  Consequently,  the  coupling  between 
translation  and  translocation  is  tighter  for  proteins 
bearing  the  invertase  signal  sequence  than  for  proteins 
carrying  the  signal  sequence  of  the  a-factor. 

We  assessed  the  in  vivo  proximity  of  the  invertase 
signal  sequence  to  Sec62p  by  measuring  the  reconsti¬ 
tution  of  Ub  from  Nub-Sec62p  and  Suc2-Cub-Dha, 
where  the  Suc2p  moiety  was  linked  to  Cub  through  a 
spacer  of  increasing  length  (Figure  2B).  The  expression 
and  efficient  translocation  of  different  Suc2-Cub-Dha 
constructs  were  assayed  by  immunoprecipitation  and 
subsequent  EndoH  treatment  (Figure  5A).  The  prox¬ 
imity  of  Cub  in  Suc2-Cub-Dha  to  Nub  of  Nub-Sec62p 
was  assayed  by  immunoblot  detection  of  the  cleavage- 
derived  free  Dha  in  whole-cell  extracts.  The  pattern 
already  observed  for  the  Mfa-Cub-Dha  constructs  re¬ 
curred  with  the  constructs  bearing  the  invertase  signal 
sequence  (Figure  5B).  Moreover,  coexpression  of  Nub- 
Sec62p  with  either  Suc223-Cub-Dha  or  Suc233-Cub-Dha 
yielded  lower  amounts  of  free  Dha  than  the  analogous 
assays  with  Nub-Sec62p  and  Mfa37-Cub-Dha,  which 
bears  a  spacer  of  comparable  length  (Figure  5B). 

Pulse-chase  analyses  with  cells  expressing  Nub- 
Sec62p  (or  Nub-Boslp)  and  either  Suc223-Cub-Dha  or 
Suc259-Cub-Dha  confirmed  the  immunoblot  data.  Spe¬ 
cifically,  a  significant  release  of  free  Dha  was  observed 
only  for  the  pair  of  Nub-Sec62p  and  Suc223-Cub-Dha 
(Figure  5C).  Our  previous  work  has  shown  that  the 
segment  of  the  nascent  polypeptide  chain  where  the 
Cub  moiety  was  inserted  in  either  the  Suc223-Cub-Dha 
or  the  Mfa37-Cub-Dha  fusion  is  transiently  exposed  to 
the  cytosol — until  the  initiation  of  ER  translocation 
(Johnsson  and  Varshavsky,  1994b).  Therefore,  we 
compared  the  ratios  of  cleaved  to  uncleaved  Suc223- 
Cub-Dha  and  Mfn37-Cub-Dha.  Cells  expressing  Nub- 
Sec62p  and  either  the  Cub  fusion  8  or  10  (Figure  2B) 
were  labeled  for  5  min  with  35S-methionine  and  pro¬ 
cessed  for  immunoprecipitation  with  anti-ha  antibody, 
followed  by  determination  of  the  cleaved-to-uncleaved 
ratio  (Figure  6D).  This  ratio,  a  measure  of  the  time- 
averaged  proximity  of  Sec62p  to  a  translocating  protein, 
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(black  bars:  b,  d,  f,  and  h)  Cub  fusions.  The  sum  of  a  cleaved  and  uncleaved  fusion  was  set  at  100  in  each  of  the  three 
also  indicated. 


Figure  6.  Sec61p,  but  not  a  mu¬ 
tant  of  Sec62p,  are  close  to  the  nas¬ 
cent  chain  of  a  translocated  pro¬ 
tein.  (A)  These  assays  employed  S. 
cerevisiae  expressing  Mfa37rCub- 
Dha  (construct  8,  Figure  2)  and 
one  of  the  following  Nub  fusions 
(Figure  2):  Nub-Sec62p,  either  inte¬ 
grated  (lane  a)  or  plasmid  borne 
(lane  b);  Nub-Sec62(AC60)Dha 
(lane  c);  and  Nub-Sec61p  (lane  d). 
Whole-cell  extracts  from  these 
strains  were  subjected  to  immuno- 
blot  analysis  with  anti-ha  anti¬ 
body.  (B)  Same  as  panel  A  but  the 
same  Nub  fusions  were  coex¬ 
pressed  with  Tpil-Cub-Dha  (con¬ 
struct  14;  Figure  2).  Numbers  2 
and  14  indicate  the  positions  of  the 
corresponding  (uncleaved)  fu¬ 
sions.  (C)  Lane  a:  S.  cerevisiae  ex¬ 
pressing  Suc223-Cub-Dha  (con¬ 
struct  10;  Figure  2)  together  with 
either  Nub-Sec61p,  Nua-Sec61p, 
Nug-Sec61p,  Nub-Sec62p,  Nua- 
Sec62p,  or  Nug-Sec62p;  lane  b; 
same  as  lane  a  but  cells  expressed 
Mfa37Uub-Dha  (construct  8;  Fig¬ 
ure  2)  instead  of  Suc223~Cub-Dha. 
(D)  S.  cerevisiae  cells  expressing 
Mfa37-Cub-Dha  together  with 
Nub-Sec61p  (a  and  b)  or  Nub- 
Sec62p  (e  and  f),  and  cells  express¬ 
ing  Suc223-Cub-Dha  together  with 
Nub-Sec61p  (c  and  d)  or  Nub- 
Sec62p  (g  and  h)  were  labeled  for  5 
min  with  35S-methionine.  Whole¬ 
cell  extracts  were  immunoprecipi- 
tated  with  anti-ha  antibody,  fol¬ 
lowed  by  SDS-PAGE,  and 
quantitation  of  the  cleaved  and 
uncleaved  Cub  fusions  using  Phos- 
phorlmager.  Shown  are  the  rela¬ 
tive  amounts  of  the  cleaved  (white 
bars:  a,  c,  e,  and  g)  and  uncleaved 
independent  experiments.  SDs  are 


was  —eightfold  higher  for  a  nascent  polypeptide  bearing 
the  signal  sequence  of  a-factor  than  for  a  nascent 
polypeptide  bearing  the  invertase  signal  sequence  (Fig¬ 
ure  6D;  compare  Figures  4C  and  5C). 

Spacer  sequences  of  different  length  or  composition 
upstream  of  the  Cub  moiety  might  nonspecifically  in¬ 
fluence  the  interaction  between  Nub  and  Cub.  To  assess 
this  potential  spacer  effect,  we  constructed  signal  se¬ 
quence-lacking  versions  of  Suc223-Cub-Dha  and 
Suc259-Cub-Dha  (Figure  2B,  Cub  fusions  15  and  16),  and 
compared  their  ability  to  reconstitute  Ub  in  the  pres¬ 
ence  of  coexpressed  Nub-Sec62p.  Both  of  these  Cub 
fusions  were  cleaved  at  the  Cub-Dha  junction  at  approx¬ 
imately  the  same  rate  in  the  presence  of  Nub-Sec62p 
(Figure  5D),  in  contrast  to  the  marked  difference  in  the 
rate  of  cleavage  observed  for  their  signal  sequence-bear¬ 
ing  counterparts  (Figure  5,  B  and  C).  This  control  exper¬ 


iment  further  emphasized  the  effect  of  distance  between 
a  signal  sequence  and  the  Cub  moiety  on  the  efficiency  of 
Ub  reconstitution  in  the  presence  of  Nub-Sec62p.  We 
conclude  that  the  accessibility  of  Sec62p  in  vivo  to  a 
specific  region  of  the  nascent  polypeptide  chain  is  influ¬ 
enced  by  both  the  nature  of  a  signal  sequence  and  its 
distance  from  that  region. 

Sec61p  Is  Equidistant  from  Two  Different  Signal 
Sequences 

A  direct  comparison  between  two  different  signal  se¬ 
quences  upstream  of  the  Cub  moiety  presumes  ap¬ 
proximately  equal  residence  times  of  the  correspond¬ 
ing  Cub  moieties  in  the  cytosol.  It  is  also  essential  to 
know  that  the  influence  of  the  identity  of  a  signal 
sequence  on  the  rate  of  Ub  reconstitution  is  not  due  to 
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a  nonspecific  intramolecular  interaction.  One  way  to 
address  these  issues  involves  measuring  the  reconsti¬ 
tution  of  Ub  from  the  Cub  moieties  of  the  fusions  8  and 
10  (Figure  2B)  and  a  Nub-containing  fusion  that  is  not 
involved  in  translocation.  As  illustrated  in  Figure  4, 
this  test  is  not  feasible  because  of  the  rapid  transloca¬ 
tion  of  Cub-containing  constructs  into  the  ER.  Note, 
however,  that  since  Sec61p  is  the  central  component  of 
the  translocation  pore,  proteins  that  utilize  different 
targeting  pathways  will  converge  at  Sec61p  shortly 
before  their  translocation  (Jungnickel  and  Rapoport, 

1995) .  Taking  advantage  of  this  property  of  Sec61p,  we 
assayed  the  proximity  of  Mfa37-Cub-Dha  and  Suc223- 
Cub-Dha  to  Nub-Sec61p.  If  Mfa37-Cub-Dha  and  Suc223- 
Cub-Dha  are  cleaved  at  the  Cub-Dha  junction  equally 
well  in  the  presence  of  Nub-Sec61p,  the  above  inter¬ 
pretation  of  the  observed  selectivity  of  Nub-Sec62p 
toward  Mfa37-Cub-Dha  (Figures  5  and  6)  would  be 
confirmed. 

To  carry  out  this  test,  Nub  was  fused  to  the  cytosolic 
N  terminus  of  Sec61p  (Figure  2A)  (Wilkinson  et  ah, 

1996) .  Nub-Sec61p  is  functionally  active  (Wittke  and 
Johnsson,  unpublished  data).  It  induced  the  release  of 
free  Dha  from  of  Mfa37-Cub-Dha  and  Tpilp-Cub-Dha 
with  efficiency  similar  to  that  of  Nub-Sec62p  (Figure  6, 
A  and  B).  Thus,  the  split-Ub  assay  independently  con¬ 
firmed  that  Sec61p  is  close  to  the  nascent  polypeptide 
chain  during  its  translocation.  To  compare  the  in  vivo 
interactions  of  Sec61p  with  the  Cub  fusions  8  and  10, 
which  bore  different  signal  sequences  (Figure  2B),  the 
amount  of  free  Dha  was  determined  by  immunoblot- 
ting  of  whole-cell  extracts.  It  was  found  that  in  the 
presence  of  Nub-Sec61p,  similar  amounts  of  Dha  were 
released  from  Mfn37-Cub-Dha  and  Suc223-Cub-Dha, 
whereas  in  the  presence  of  Nub-Sec62p  twice  as  much 
Dha  was  released  from  Mfn37-Cub-Dha  than  from 
Suc223-Cub-Dha  (Figure  6C). 

This  result  was  confirmed  and  extended  by  label¬ 
ing  the  cotransformed  cells  for  5  min  with  35S-me- 
thionine  and  quantifying  the  ratio  of  cleaved-to- 
uncleaved  Cub  fusions  (Figure  6D).  As  was  already 
observed  by  the  immunoblot  analysis,  the  above 
ratio  was  ~1  for  both  Mfa37-Cub-Dha  and  Suc223- 
Cub-Dha  in  the  presence  of  Nub-Sec61p,  but  ~2  for 
Mfa37-Cub-Dha,  and  —0.25  for  Suc223-Cub-Dha  in  the 
presence  of  Nub-Sec62p  (Figure  6D).  The  difference 
revealed  by  the  pulse-immunoprecipitation  analysis 
is  higher  than  the  estimate  obtained  by  the  immu¬ 
noblot  analysis,  most  likely  because  of  the  continu¬ 
ous  accumulation  of  cleaved  (and  long-lived)  Dha 
before  the  processing  of  cells  for  immunoblotting. 

A  C-terminally  Truncated  Sec62p  Is  No  Longer 
Proximal  to  the  Signal  Sequence 

Does  the  proximity  of  Sec62p  to  a  nascent  polypeptide 
chain  that  is  detected  by  the  split-Ub  assay  reflect  the 


physical  binding  of  the  signal  sequence  to  this  pro¬ 
tein?  We  constructed  a  derivative  of  Nub-Sec62p  in 
which  the  C-terminal  60  residues  of  Sec62p  were  re¬ 
placed  by  the  DHFR-ha  (Dha)  moiety,  yielding 
Sec62(AC60)-Dha.  A  similar  Sec62p-invertase  fusion 
was  described  by  Deshaies  and  Schekman  (1990)  and 
shown  to  be  nonfunctional.  As  expected,  neither 
Sec62(AC60)~Dha  nor  Nub-Sec62(AC60)-Dha  comple¬ 
mented  the  ts  phenotype  of  RSY529  cells  (Figure  3B). 

The  Ub-reconstitution  activity  of  Nub-Sec62(AC60 )- 
Dha  in  the  presence  of  either  Mfa37-Cub-Dha  or  Tpil- 
Cub-Dha  (Figure  6,  A  and  B)  was  compared  with  the 
activity  of  Nub-Sec62p  and  Nub-Sec61p  in  the  presence 
of  the  same  Cub-containing  fusions.  Remarkably,  no 
cleavage  of  Mfa37-Cub-Dha  was  observed  in  the  pres¬ 
ence  of  Nub-Sec62(AC60)-Dha,  whereas  the  cytosolic 
Tpil-Cub-Dha  was  cleaved.  This  result  (Figure  6,  A 
and  B)  indicated  that  the  concentration  and  accessibil¬ 
ity  of  Nub  were  comparable  for  the  functionally  inac¬ 
tive  Nub-Sec62(AC60)-Dha  and  the  functionally  active 
Nub-Sec62p.  In  these  experiments,  Nub-Sec62(AC60)- 
Dha,  which  could  be  detected  with  the  anti-ha  anti¬ 
body  (Figure  6,  A  and  B),  was  expressed  from  the 
uninduced  PCUpi  promoter.  Strikingly,  even  overex¬ 
pression  of  Nub-Sec62(AC60)-Dha,  from  the  copper- 
induced  Pcupu  did  not  result  in  a  significant  cleavage 
of  Mfa37-Cub-Dha  (our  unpublished  results).  These 
control  experiments  with  the  inactive  derivative  of 
Sec62p  indicated  that  the  proximity  signal  in  the 
split-Ub  assay  with  Sec62p  requires  the  functional  ac¬ 
tivity  of  Sec62p. 

Using  Ura3p  Reporter  to  Detect  the  In  Vivo 
Proximity  between  Sec62p  and  Signal  Sequences 

The  DFIFR-ha  (Dha)  reporter  moiety  of  Mfa37-Cub- 
Dha  was  replaced  by  S.  cerevisiae  Ura3p  (orotidine-5'- 
phosphate  decarboxylase),  yielding  Mfn37-  Cub-Ura3p. 
The  use  of  cytosolic  Ura3p  as  a  reporter  for  transloca¬ 
tion  across  membranes  is  well  documented  (Maarse  et 
al,  1992;  Johnsson  and  Varshavsky,  1994b;  Ng  et  ah, 
1996).  The  high  sensitivity  of  Ura3p-based  assays  (cells 
become  Ura+  if  a  threshold  amount  of  Ura3p  is 
present  in  the  cytosol)  allowed  us  to  express  the  Nub 
and  Cub  fusions  from  the  uninduced  PCUP1  promoter. 
Since  the  efficient  translocation  of  Mfa37-Cub-Ura3p 
sequesters  the  Ura3p  activity  in  the  ER,  a  ura3A  strain 
of  S.  cerevisiae  that  expressed  Mfa37”Cub-Ura3p  re¬ 
mained  Ura-  (Figure  7A).  Nub-Sec62p,  which,  as 
shown  above,  is  close  to  the  nascent  chain  of  Mfa37- 
Cub-Dha  during  its  translocation,  induced  enough 
cleavage  of  Mfn37-Cub-Ura3p  at  the  Cub-Ura3p  junc¬ 
tion  to  render  cells  Ura+  (Figure  7B).  Cells  were  trans¬ 
formed  with  either  Nub-Sec62p,  Nub-Sec61p,  Nub- 
Sec62(AC60)-Dha,  or  Nub-Boslp  to  compare  relative 
proximities  of  these  Nub-containing  proteins  to  Cub 
fusions  bearing  the  Ura3p  reporter  moiety  and  either 
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Figure  7.  The  use  of  a  metabolic  marker  to  assess  the  proximity 
between  a  component  of  the  translocation  machinery  and  a  trans¬ 
located  protein.  S.  cerevisiae  expressing  the  Cub  fusions  8-12  (Figure 
2)  that  contained  Ura3p  instead  of  Dha  (see  the  main  text)  were 
transformed  with  the  vector  (A)  or  plasmids  expressing  Nub-Sec62p 
(B),  Nub-Sec61p  (C),  Nub-Sec62(AC60)Dha  (D),  and  Nub-Boslp  (E). 
Cells  were  grown  in  a  liquid  uracil-containing  SD  medium,  and 
~105,  103,  and  102  cells  were  spotted  onto  uracil-lacking  SD  me¬ 
dium.  Plates  were  examined  after  18  h  at  30°C. 


the  invertase-derived  or  the  a-factor-derived  signal 
sequence  (Figure  7,  B-E).  The  cells  were  spotted  on 
plates  lacking  uracil  and  incubated  at  30°C  for  18  h. 
The  growth  patterns  of  strains  that  expressed  different 
combinations  of  Nub-  and  Cub-containing  fusions  con¬ 
firmed  the  results  of  analyses  with  analogous  (but 
more  highly  expressed)  Dha-based  constructs. 

In  particular,  the  interaction  of  Sec62p  with  the  sig¬ 
nal  sequence  of  prepro-a-factor  was  stronger  than 
with  the  signal  sequence  of  invertase.  This  proximity 
was  not  detectable  when  the  distance  between  a  signal 
sequence  and  the  Cub  moiety  of  a  fusion  was  increased 
(Figure  7B).  Sec61p  appears  to  be  equally  close  to  both 
of  the  signal  sequences  tested.  Again,  the  proximity 
signal  was  gradually  lost  when  the  distance  between 
the  signal  sequence  and  the  Cub  moiety  was  increased 
(Figure  7C).  Cells  acquired  a  weak  Ura+  phenotype  in 
the  presence  of  Nub-Boslp  and  the  Cub-Ura3p  fusions 


8  and  10  (Figure  7E).  If  used  as  a  reference  to  discrim¬ 
inate  between  specific  and  nonspecific  signals  in  this 
assay,  Sec62p,  under  these  conditions,  appears  to  in¬ 
teract  only  with  the  a-factor  signal  sequence.  No  in¬ 
teraction  with  any  of  the  tested  Cub  constructs  was 
detectable  with  the  functionally  inactive  Nub- 
Sec62(AC60)-Dha  (Figure  7D). 

DISCUSSION 

The  new  application  of  the  split-Ub  technique  (Johns- 
son  and  Varshavsky,  1994a,  1997)  described  in  the 
present  work  introduces  a  tool  for  the  analysis  of 
transient  (short-lived)  protein  interactions  in  living 
cells.  A  split-Ub  assay  involves  the  tagging  of  two 
(presumably)  interacting  proteins  with  the  N-  and 
C-terminal  halves  of  Ub,  Nub  and  Cub,  and  monitoring, 
in  a  variety  of  ways,  the  release  of  the  reporter  protein 
fused  to  the  C  terminus  of  Cub.  The  reporter  release, 
through  the  cleavage  by  Ub-specific  processing  pro¬ 
teases  (UBPs),  takes  place  in  the  cytosol  if  the  two 
halves  of  Ub  interact  in  vivo  to  form  a  quasi-native  Ub 
moiety  upstream  of  the  reporter  (Figure  1).  Among  the 
advantages  of  this  method  are  its  applicability  either 
in  living  cells  or  in  vitro  and  its  sensitivity  to  kinetic 
aspects  of  a  protein  interaction. 

In  the  present  work,  we  applied  the  split-Ub  tech¬ 
nique  to  the  problem  of  protein  translocation  across 
membranes.  We  showed  that  Sec62p  of  S.  cerevisiae  is 
spatially  close  to  the  signal  sequence  of  the  nascent 
a-factor  polypeptide  in  vivo.  This  proximity  is  con¬ 
fined  to  the  nascent  polypeptide  chain  immediately 
following  the  signal  sequence.  In  addition,  the  extent 
of  proximity  depends  on  the  nature  of  the  signal  se¬ 
quence.  Specifically,  Cub-containing  test  proteins  that 
bore  the  signal  sequence  of  invertase  resulted  in  a 
much  lower  Ub  reconstitution  with  Nub-Sec62p  than 
the  same  Cub-containing  proteins  bearing  the  signal 
sequence  of  a-factor.  An  inactive  derivative  of  Sec62p 
failed  to  interact  with  signal  sequences  in  the  split-Ub 
assay.  Taken  together,  these  findings  are  the  first  in 
vivo  evidence  that  S.  cerevisiae  Sec62p,  a  component  of 
the  ER  translocation  machinery,  is  a  part  of  a  signal 
sequence-binding  complex. 


In  Vivo  Proximity  between  Sec62p  and  the  Signal 
Sequence  of  a-Factor 

We  have  previously  shown  that  a  region  of  the  nascent 
polypeptide  chain  that  lies  close  to  the  signal  sequence 
of  invertase  or  the  prepro-a-factor  is  briefly  exposed  to 
the  cytosol  before  its  translocation  into  the  ER  (Johns- 
son  and  Varshavsky,  1994b).  This  feature  of  transloca¬ 
tion  enabled  us,  in  the  present  work,  to  use  the 
split-Ub  assay  for  monitoring  the  proximity  between  a 
secretory  protein  and  components  of  the  translocation 
machinery.  The  Cub  moiety  was  placed  37  residues 
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downstream  from  the  signal  sequence  of  the  a-factor 
precursor,  and  the  N  terminus  of  Sec62p  (see  INTRO¬ 
DUCTION)  was  extended  with  Nub.  Using  this  ver¬ 
sion  of  the  split-Ub  assay,  we  could  demonstrate  that 
Sec62p  is  close  to  the  nascent  chain  of  the  a-factor 
during  its  translocation.  By  moving  the  Cub  moiety 
farther  downstream  from  the  signal  sequence  of  the 
a-factor  precursor,  we  obtained  "snapshots"  of  the 
relative  proximity  between  the  nascent  polypeptide 
chain  and  Sec62p  in  vivo.  The  proximity  thus  detected 
was  considerably  reduced  once  the  spacer  sequence 
between  the  signal  sequence  of  the  a-factor  precursor 
and  CLlb  was  increased  from  37  to  65  residues  (Figures 
5B  and  7B).  The  data  strongly  suggest  that  the  access 
of  Sec62p  to  the  nascent  chain  of  a-factor  is  confined  to 
a  region  of  the  nascent  chain  that  is  very  close  to  the 
signal  sequence.  Similar  results  were  obtained  with 
the  signal  sequence  of  invertase  as  well.  This  property 
of  Sec62p  is  the  expected  feature  of  a  component  of  a 
signal  sequence  receptor. 

Our  interpretation,  supported  by  several  control  ex¬ 
periments,  is  in  agreement  with  the  results  produced 
by  cross-linking  and  binding  studies  in  cell-free  sys¬ 
tems  (Musch  et  al,  1992;  Lyman  and  Schekman,  1997; 
Matlack  et  al,  1997).  Sec62p  could  be  cross-linked  to 
the  a-factor  precursor  in  vitro,  but  only  when  ATP 
was  omitted  and  the  initiation  of  translocation  of 
a-factor  was  halted.  Upon  the  addition  of  ATP,  the 
translocation  resumed  and  cross-linking  was  no 
longer  possible  (Musch  et  al,  1992;  Lyman  and  Schek¬ 
man,  1997).  The  cross-linking  between  Sec62p  and  the 
nascent  polypeptide  chain  was  not  observed  when  the 
translocating  chain  was  halted  in  the  ER  channel 
(Musch  et  al,  1992;  Sanders  et  al ,  1992).  It  was  there¬ 
fore  assumed  that  Sec62p  is  not  a  part  of  the  channel 
and  that  it  functions  in  the  early  steps  of  substrate 
recognition  and  initiation  of  translocation.  The 
split-Ub  assay,  in  its  current  form,  depends  on  both 
halves  of  Ub  being  in  the  cell's  cytosol.  Therefore,  the 
absence  of  the  diagnostic  cleavage  (Figures  5B,  5C, 
and  7)  when  the  Cub  moiety  was  placed  farther  down¬ 
stream  from  the  signal  sequence  (see  RESULTS),  while 
consistent  with  the  absence  of  interactions  between 
Sec62p  and  the  nascent  chain  after  the  initiation  of  its 
translocation,  does  not  address  this  issue  directly.  The 
Cub  moiety  that  emerges  from  the  ribosome  after  it  has 
docked  at  the  ER  channel  is  not  accessible  to  the 
cytosolic  Nub  moiety  even  if  an  Nub-linked  protein  is 
spatially  close  to  the  translocation  pore.  This  also  ex¬ 
plains  the  inability  of  Nub-Sec61p  to  induce  the  cleav¬ 
age  of  Cub-containing  translocation  substrates  bearing 
long  spacer  sequences  between  the  signal  sequence  and 
Cub,  although  the  in  vitro  cross-linking  studies  have 
shown  Sec61p  to  be  in  constant  contact  with  the  trans¬ 
locating  polypeptide  (Mothes  et  al,  1994)  (Figure  7). 

The  proximity  between  Sec62p  (or  Sec61p)  and  a 
translocating  polypeptide  is  short  lived.  The  rapid 


transfer  of  the  Cub  moiety  into  the  lumen  of  the  ER 
was  shown  to  either  prevent  or  strongly  inhibit  its 
interaction  with  the  Nub  moiety  of  Nub-Sec62p  and 
Nub-Sec61p  (Figures  3A  and  6C).  In  these  experiments, 
the  Nub  moieties  bore  either  the  glycine  (Nug)  or  the 
alanine  (Nua)  residue  at  position  13  of  Nub.  These 
modifications  decrease  the  affinity  between  the  two 
halves  of  Ub  (see  INTRODUCTION),  thereby  making 
the  reconstitution  of  a  quasi-native  Ub  moiety  more 
dependent  on  the  stability  (half-life)  of  interactions 
between  the  proteins  linked  to  Nub  and  Cub.  These 
assays  clearly  distinguished  the  Sec62p-Sec61p  signal 
sequence  interactions  from  those  that  underlie  the  bet¬ 
ter  understood,  longer-lived  protein  complexes.  For 
example,  when  linked  to  homodimerizing  leucine  zip¬ 
pers,  the  Nua  moiety,  and  even  the  Nug  moiety,  is 
sufficient  for  reconstitution  of  the  Ub  (Jonnsson  and 
Varshavsky,  1994a). 

Is  Sec62p  Part  of  a  Signal  Sequence  Receptor? 

The  split-Ub  assay  measures  the  concentrations  of  the 
protein-coupled  Nub  and  Cub  moieties  in  the  immedi¬ 
ate  vicinity  of  each  other.  Therefore,  a  positive  result 
of  a  split-Ub  assay  signifies  a  spatial  proximity  be¬ 
tween  the  two  proteins  but  cannot,  by  itself,  prove 
their  physical  interaction  or  address  the  functional 
significance  of  this  proximity.  Nub-Sec62(AC60)Dha  is 
functionally  inactive  and  was  shown  to  be  not  close  to 
the  translocating  Mfa37-Cub-Dha  (Figures  6  and  7). 
Since  the  sequence  between  Nub  and  the  first  mem¬ 
brane-spanning  region  of  Sec62  was  retained  in  the 
C-terminally  truncated  Sec62(AC60)Dha,  the  distance 
between  Nub  and  the  ER  membrane  was,  most  prob¬ 
ably,  not  altered  relative  to  wild- type  Sec62p.  The  lack 
of  significant  cleavage  of  Mfa37-Cub-Dha  in  the  pres¬ 
ence  of  Nub-Sec62(AC60)Dha  must  therefore  result 
from  the  increased  distance  between  Sec62(AC60)p 
and  the  translocating  polypeptide  chain.  The  C-termi- 
nal  domain  of  intact  Sec62p  may  contact  other  com¬ 
ponents  of  the  translocation  complex;  alternatively,  it 
may  contribute  to  a  binding  site  for  the  signal  se¬ 
quence  or  the  nascent  chain.  These  and  related  uncer¬ 
tainties  notwithstanding,  our  results  (Figures  6  and  7) 
provide  the  first  in  vivo  evidence  to  support  the  view 
that  Sec62p  is  part  of  a  signal  sequence-binding  com¬ 
plex. 

Sec62p  Discriminates  between  Different  Signal 
Sequences 

The  split-Ub  assay  has  made  it  possible  to  show  that 
Sec62p  discriminates,  in  living  cells,  between  two  dis¬ 
tinct  signal  sequences  in  otherwise  identical  fusion 
proteins.  In  the  presence  of  Nub-Sec62p,  more  of  the 
free  Dha  reporter  protein  was  produced  in  vivo  from 
Mfa37-Cub-Dha  than  from  Suc223-Cub-Dha  (Figures  5 
and  6).  This  selectivity  is  a  property  of  Sec62p  and  not 
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a  feature  of  the  assay  used,  since  approximately  equal 
amounts  of  the  cleaved  reporter  were  produced  with 
different  signal  sequences  if  the  Nub  moiety  was 
present  as  the  Nub-Sec61p  fusion  (Figures  6  and  7). 
This  result  confirmed,  in  vivo,  that  different  targeting 
pathways  converge  at  the  Sec61 -containing  complex 
to  initiate  translocation.  The  existence  of  at  least  two 
different  targeting  pathways  to  the  translocation  pore 
was  suggested  by  Walter  and  co-workers  on  the  basis 
of  the  properties  of  yeast  mutants  that  lacked  either 
SRP  or  its  receptor  (Harm  and  Walter,  1991;  Ogg  et  al , 
1992).  One  possibility  is  that  the  targeting  via  SRP 
operates  cotranslationally,  whereas  the  targeting  via 
the  Sec62/63  complex  is  predominantly  posttransla- 
tional.  The  bifurcation  between  the  cotranslational  and 
posttranslational  targeting  is  expected  to  be  stochastic 
for  many  translocated  proteins.  Nonetheless,  certain 
signal  sequences  do  prefer  SRP,  while  some  of  the 
other  signal  sequences  are  targeted  by  the  Sec62/ 
Sec63  complex  (Ng  et  al ,  1996).  Genetic  studies  have 
shown  that  the  translocation  of  invertase  continues  in 
the  presence  of  mutations  in  either  the  SRP  or  the 
Sec62/63  complex  (Deshaies  and  Schekman,  1989; 
Ogg  et  al,  1992).  However,  the  kinetics  and  efficiency 
of  invertase  translocation  are  altered  in  the  absence  of 
SRP  (Harm  and  Walter,  1991;  Johnsson  and  Var¬ 
shavsky,  1994b). 

To  explain  the  different  efficiencies  of  cleavage  of  the 
two  signal  sequence-bearing  Cub  fusions,  we  propose 
the  following  model.  The  signal  sequence  of  invertase 
is  recognized  primarily  by  SRP  and  then  transferred  to 
the  trimeric  Sec61p  complex,  which  completes  the  pro¬ 
tein's  translocation  across  the  ER  membrane.  Under 
conditions  that  result  in  a  shortage  of  SRP  or  a  com¬ 
petition  among  different  signal  sequences,  the  Sec62p/ 
Sec63p-containing  complex,  being  a  part  of  the  alter¬ 
native  targeting  pathway,  would  recognize  an 
increasing  fraction  of  invertase.  This  would  explain 
why  specific  interactions  between  the  invertase  signal 
sequence-bearing  proteins  and  Sec62p  can  only  be 
observed  for  the  more  highly  expressed  Cub-Dha  fu¬ 
sions  (Figure  5C).  By  contrast,  the  targeting  of  proteins 
bearing  the  signal  sequence  of  the  a-factor  precursor  is 
mediated,  in  vivo ,  predominantly  by  the  Sec62p/ 
Sec63p-containing  complex.  This  would  account  for 
the  observed  close  proximity  of  these  test  proteins  to 
Sec62p  (Figures  5-7).  If  so,  the  split-Ub  technique 
makes  it  possible  to  estimate  the  flux  of  two  different 
secretory  proteins  through  the  targeting  pathways  to 
the  ER  membrane  without  the  necessity  of  deleting  or 
otherwise  inactivating  specific  components  of  the  tar¬ 
geting  complex. 

Further  Applications  of  the  Split-Ub  Technique 

The  split-Ub  sensor  should  also  be  applicable  to  other 
settings  that  involve  short-lived  protein  interactions 


that  occur  in  the  cytosol  and  are  freely  accessible  to  the 
Ub-specific  proteases.  The  advantage  of  using  this 
method  for  the  analysis  of  protein  translocation  stems, 
in  part,  from  the  fast  and  irreversible  removal  of  the 
translocated  chain  from  the  location  (cytosol)  where 
the  Nub/Cub  interaction  is  monitored.  Similar  situa¬ 
tions  are  expected  for  the  translocation  of  proteins  into 
other  organelles  such  as  the  mitochondrion,  the  nu¬ 
cleus,  and  the  peroxisome. 

The  demonstration,  in  the  present  work,  that  Ura3p 
can  serve  as  a  reporter  in  a  split-Ub  assay  (Figure  7) 
opens  the  way  to  genetic  screens  based  on  this  assay. 
For  example,  introducing  a  DNA  library  consisting  of 
random  Nub-gene  fusions  into  a  strain  expressing  a 
signal  sequence-bearing  Cub-Ura3p  protein  should  al¬ 
low  the  identification  of  genes  involved  in  targeting  or 
translocation  by  enabling  the  cells  to  form  colonies  on 
media  lacking  uracil.  This  selection  for  protein  ligands 
that  interact  transiently  in  the  vicinity  of  a  membrane 
complements  the  recent  split-Ub- based  screen  for  li¬ 
gands  that  form  relatively  stable  complexes  (Stagljar  et 
al,  1998). 
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Mutant  allele  pso7-l,  that  sensitizes  Saccharomyces  cerevisiae 
to  photoactivated  psoralen,  is  allelic  with  COX11,  encoding 
a  protein  indispensable  for  a  functional  cytochrome  c  oxidase 
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Abstract  The  yeast  gene  PS07  was  cloned  from  a  ge¬ 
nomic  library  by  complementation  of  the  pso7-l  mu¬ 
tant’s  sensitivity  phenotype  to  4-nitroquinoline-l -oxide 
(4NQO).  Sequence  analysis  revealed  that  PS07  is  allelic 
to  the  1.1  -kb  ORF  of  the  yeast  gene  COX  11  which  is 
located  on  chromosome  XVI  and  encodes  a  protein  of 
28-kDa  localized  in  the  inner  mitochondrial  membrane. 
Allelism  of  PS07/C0X1 1  was  verified  by  non-comple¬ 
mentation  of  4NQO-sensitivity  in  diploids  homo-  and 
hetero-allelic  for  the  pso7-l  and  coxl lv.TRPl  mutant 
alleles.  Sensitivity  to  4NQO  was  the  same  in  exponen¬ 
tially  growing  cells  of  the  pso7-l  mutant  and  the 
coxl  lv.TRPl  disruptant.  Allelism  of  COX11  and  PS07 
indicates  that  the  pso7  mutant’s  sensitivity  to  photoac¬ 
tivated  3-carbethoxypsoralen  and  to  4NQO  is  not 
caused  by  defective  DNA  repair,  but  rather  is  due  to  an 
altered  metabolism  of  the  pro-mutagen  4NQO  in  the 
absence  of  cytochrome  oxidase  (Cox)  in  pso7-lj 
coxl  lv.TRPl  mutants/disruptants.  Lack  of  Cox  might 
also  lead  to  a  higher  reactivity  of  the  active  oxygen 
species  produced  by  photoactivated  3-carbethoxypsor¬ 
alen.  The  metabolic  state  of  the  cells  is  important  for 
their  sensitivity  phenotype  since  the  largest  enhancement 
of  sensitivity  to  4NQO  between  wild-type  (WT)  and  the 
pso7  mutant  occurs  in  exponentially  growing  cells,  while 
cells  in  stationary  phase  or  growing  cells  in  phosphate 
buffer  have  the  same  4NQO  resistance,  irrespective  of 
their  WT/mutant  status.  Strains  containing  the pso7-l  or 
coxl  lv.TRPl  mutant  allele  were  also  sensitive  to  the 
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oxidative  stress-generating  agents  H2O2  and  paraquat. 
Mutant  pso7- 7,  as  well  as  disruptant  coxllv.TRPl , 
harboured  mitochondria  that  in  comparison  to  WT 
contained  less  than  5%  and  no  detectable  Cox  activity, 
respectively. 

Key  words  Psoralen  sensitivity  *  Cytochrome  oxidase  * 
Saccharomyces  cerevisiae  ■  Oxidative  stress 


Introduction 

The  pso  mutants  of  Saccharomyces  cerevisiae  are  sensi¬ 
tive  to  photoactivated  psoralens.  PSO  genes  have  re¬ 
cently  been  divided  into  two  groups;  those  which  seem 
directly  involved  in  DNA  repair  and  those  that  most- 
likely  are  not.  Whereas  PSOl  (allelic  to  REV 3)^  PSO 2 
(allelic  to  SNM1 ),  PS03,  PS04  (allelic  to  PRP19 ),  and 
PS05  (allelic  to  RAD16)  encode  proteins  with  apparent 
functions  in  error-free  and  error-prone  repair  (see  review 
by  Henriques  et  al.  1997)  the  genes  PS06  and  PS07 ,  by 
virtue  of  the  weak  sensitivity  phenotype  of  their  mutant 
alleles  (Querol  et  al  1994),  are  thought  not  to  represent 
genes  encoding  repair  proteins.  Sensitivity  in  these  latter 
mutants  is  mainly  for  treatments  involving  agents  that, 
amongst  other  features,  induce  oxidative  stress  (Querol 
et  al.  1994).  PS06  has  recently  been  cloned  and  found  to 
be  allelic  with  the  gene  ERG3  encoding  sterol  C-5  de- 
saturase,  an  enzyme  necessary  for  the  synthesis  of  er- 
gosterol  (Schmidt  et  al.  1999). 

For  PS07  the  only  existing  mutant  allele,  pso7~l ,  was 
shown  to  primarily  confer  a  moderate  sensitivity  to 
photoactivated  3-carbethoxypsoralen  (3CPs  +  UVA). 
Stationary  phase  cells  of  the  pso7-l  mutant  have  nearly 
wild-type  (WT)  resistance  for  a  range  of  mutagens, 
namely  UV254  nm,  8-MOP  +  UVA,  nitrogen  half  mus¬ 
tard  (HN1),  and  4NQO  (Querol  et  al.  1994).  However, 
pso7-l  mutants  do  exhibit  increased  sensitivity  to  4NQO 
when  the  pro-mutagen  is  added  to  exponentially  grow¬ 
ing  cells.  This  feature  was  used  in  our  strategy  to  mo- 
lecularly  clone  PS07  via  the  isolation  of  4NQO-resistant 
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Abstract  The  N-end  rule  pathway  is  a  ubiquitin-de- 
pendent  proteolytic  system  whose  targets  include 
proteins  bearing  destabilizing  N-terminal  residues.  We 
carried  out  a  synthetic  lethal  screen  for  Saccharomyces 
cerevisiae  mutants  that  require  the  N-end  rule  pathway 
for  cell  viability.  A  mutant  thus  identified,  termed  sln2, 
could  not  grow  in  the  absence  of  Ubrlp,  the  recognition 
component  of  the  N-end  rule  pathway,  which  was  not 
essential  for  viability  of  the  parental  strain  under  the 
same  conditions.  Further  analysis  showed  that  inviabil¬ 
ity  of  sln2  ubrlA  cells  could  be  rescued  either  by  the 
HIS3  gene  (which  was  absent  from  the  parental  strain) 
or  by  a  high  concentration  of  histidine  in  the  medium. 
This  defect  in  histidine  uptake,  exhibited  by  the  sln2 
mutant  in  the  absence  but  not  in  the  presence  of  Ubrlp, 
was  traced  to  the  gene  HIP l,  which  encodes  the  histidine 
transporter.  HIP1  was  underexpressed  in  sln2  ubrlA 
cells,  in  comparison  to  either  sln2  UBR1  or  SLN2  ubrlA 
cells.  Yet  another  property  of  the  sln2  mutant  was  its 
inviability  at  37  °C,  which  could  not  be  rescued  by  either 
UBR1  or  HISS.  This  feature  of  sln2  allowed  the  cloning 
of  SLN2,  which  was  found  to  be  a  gene  called  CIN8, 
encoding  a  kinesin-like  protein.  Thus,  either  the  N-end 
rule  pathway  or  Cin8p  must  be  present  for  the  viability- 
sustaining  rate  of  histidine  import  in  S.  cerevisiae  aux¬ 
otrophic  for  histidine.  We  consider  possible  mechanisms 
of  this  previously  unsuspected  link  between  kinesins, 
ubiquitin-dependent  proteolysis,  and  the  import  of  his¬ 
tidine. 

Key  words  Ubiquitin  •  Proteolysis  •  Kinesin  •  Histidine 
import  •  CIN8  •  H1P1  ■  UBR1 
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Introduction 

A  number  of  regulatory  circuits  involve  metabolically 
unstable  proteins.  A  short  in  vivo  half-life  of  a  regulator 
provides  a  way  to  generate  its  spatial  gradients  and 
allows  for  rapid  adjustments  of  its  concentration,  or 
subunit  composition,  through  changes  in  the  rate  of 
regulator’s  synthesis  or  degradation.  Short  in  vivo  half- 
lives  are  also  characteristic  of  damaged  or  otherwise 
abnormal  proteins  (Hochstrasser  1996;  Varshavsky 
1997;  Peters  et  al.  1998;  Scheffner  et  al.  1998).  Features 
of  proteins  that  confer  metabolic  instability  are  called 
degradation  signals,  or  degrons.  The  essential  compo¬ 
nent  of  one  degradation  signal,  the  first  to  be  identified, 
is  a  destabilizing  N-terminal  residue  of  a  protein 
(Bachmair  et  al.  1986;  Varshavsky  1996).  The  set  of 
amino-acid  residues  which  are  destabilizing  in  a  given 
cell  yields  a  rule,  called  the  N-end  rule,  which  relates  the 
in  vivo  half-life  of  a  protein  to  the  identity  of  its  N- 
terminal  residue.  Similar  but  distinct  N-end  rule  path¬ 
ways  are  present  in  all  organisms  examined,  from 
mammals  and  plants  to  fungi  and  bacteria  (Bachmair 
and  Varshavsky  1989;  Gonda  et  al.  1989;  Tobias  et  al. 
1991;  Varshavsky  1996;  Worley  et  al.  1998). 

In  eukaryotes,  the  degradation  signals  (N-degrons) 
recognized  by  the  N-end  rule  pathway  comprise  two 
determinants:  a  destabilizing  N-terminal  residue  and  an 
internal  lysine  or  lysines  (Bachmair  and  Varshavsky 
1989;  Hill  et  al.  1993;  Dohmen  et  al.  1994).  The  Lys 
residue  is  the  site  of  formation  of  a  multiubiquitin  chain 
(Chau  et  al.  1989).  The  N-end  rule  pathway  is  thus  one 
pathway  of  the  ubiquitin  system.  Ubiquitin  is  a  76-res¬ 
idue  protein  whose  covalent  conjugation  to  other  pro¬ 
teins  plays  a  role  in  a  multitude  of  processes,  including 
cell  growth,  division,  differentiation,  and  responses  to 
stress  (Johnson  et  al.  1995;  Hochstrasser  1996;  Hershko 
1997;  Pickart  1997;  Varshavsky  1997;  Scheffner  et  al. 
1998).  In  most  of  these  processes,  Ub  acts  through 
routes  that  involve  the  degradation  of  ubiquitin-protein 
conjugates  by  the  26S  proteasome,  an  ATP-dependent 
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multisubunit  protease  (Rechsteiner  et  al.  1993;  Coux 
et  al.  1996;  Baumeister  et  al.  1998;  Glickman,  et  al. 
1998). 

The  N-end  rule  is  organized  hierarchically.  In  the 
yeast  Saccharomyces  cerevisiae ,  Asn  and  Gin  are  tertiary 
destabilizing  N-terminal  residues  in  that  they  function 
through  their  conversion  into  the  secondary  destabilizing 
N-terminal  residues  Asp  and  Glu.  This  conversion  is 
mediated  by  the  A7/1 /-encoded  N-terminal  amidase 
(Nt-amidase)  (Baker  and  Varshavsky  1995).  The  desta¬ 
bilizing  activity  of  N-terminal  Asp  and  Glu  requires 
their  conjugation,  by  the  ,4  T£7 -encoded  Arg-tRNA- 
protein  transferase  (R-transferase),  to  Arg,  one  of  the 
primary  destabilizing  residues  (Balzi  et  al.  1990).  The 
primary  destabilizing  N-terminal  residues  are  bound 
directly  by  the  UBRJ -encoded  N-recognin  (also  called 
E3),  the  recognition  component  of  the  N-end  rule 
pathway  (Bartel  et  al.  1990).  In  S.  cerevisiae ,  N-recognin 
is  a  225  kDa  protein  that  binds  to  potential  N-end  rule 
substrates  through  their  primary  destabilizing  N-termi- 
nal  residues  -  Phe,  Leu,  Trp,  Tyr,  lie,  Arg,  Lys,  and  His 
(Varshavsky  1996).  The  £/6r/-encoded  mammalian  N- 
recognin  (E3ot)  has  also  been  characterized  (Hershko 
1991;  Kwon  et  al.  1998).  Ubrlp  has  at  least  two  sub¬ 
strate-binding  sites.  The  type-1  site  is  specific  for  the 
basic  N-terminal  residues  Arg,  Lys  and  His.  The  type-2 
site  is  specific  for  the  bulky  hydrophobic  N-terminal 
residues  Phe,  Leu,  Trp,  Tyr,  and  lie  (Varshavsky  1996). 
Both  yeast  and  mammalian  Ubrlp  have  also  been 
shown  to  recognize  a  set  of  internal  degrons  that  remain 
to  be  characterized  (Gonen  et  al.  1991;  Madura  and 
Varshavsky  1994;  Byrd  et  al.  1998). 

The  known  functions  of  the  N-end  rule  pathway  in¬ 
clude  the  control  of  peptide  import  in  S.  cerevisiae , 
through  the  degradation  of  Cup9p,  a  transcriptional 
repressor  of  PTR2  which  encodes  the  peptide  trans¬ 
porter  (Alagramam  et  al.  1995;  Byrd  et  al.  1998);  a  role 
in  regulating  the  Slnlp-dependent  phosphorylation  cas¬ 
cade  that  mediates  osmoregulation  in  S.  cerevisiae  (Ota 
and  Varshavsky  1993);  the  degradation  of  Gpalp,  a  Ga 
protein  of  S.  cerevisiae  (Madura  1994,  #60);  and  the 
conditional  degradation  of  alphaviral  RNA  polymerase 
in  virus-infected  metazoan  cells  (deGroot  et  al.  1991; 
Varshavsky  et  al.  1998).  Physiological  N-end  rule  sub¬ 
strates  were  also  identified  among  the  proteins  secreted 
into  the  mammalian  cell’s  cytosol  by  intracellular  para¬ 
sites  such  as  the  bacterium  Listeria  monocytogenes  (Sijts 
et  al.  1997).  Inhibition  of  the  N-end  rule  pathway  was 
reported  to  interfere  with  mammalian  cell  differentiation 
(Hondermarck  et  al.  1992),  and  to  delay  limb  regener¬ 
ation  in  amphibians  (Taban  et  al.  1996).  In  mammals, 
the  N-end  rule  pathway  is  also  likely  to  play  a  role  in 
catabolic  states  that  result  in  muscle  atrophy  (Solomon 
et  al.  1998  a,b). 

While  the  absence  of  the  N-end  rule  pathway 
(specifically,  the  absence  of  Ubrlp)  from  S.  cerevisiae  is 
lethal  under  conditions  where  the  import  of  peptides  is 
essential  for  cell  survival  (Alagramam  et  al.  1995;  Byrd 
et  al.  1998),  ubrlA  cells  are  viable  and  only  slightly 


growth-retarded  on  standard  media  (Bartel  et  al.  1990). 
Synthetic  lethal  screens  can  often  identify  pathways 
whose  essential  functions  overlap  (or  interact)  with  the 
function  of  a  system  of  interest  (Singer-Kruger  and 
Ferro-Novick  1997;  Ho  et  al.  1998;  Merrill  and  Holm 
1998).  An  earlier  screen  for  S.  cerevisiae  mutants  that 
require  Ubrlp  for  viability  has  identified  Sin  Ip,  which 
contains  a  sensor  kinase  and  response  regulator  domains 
analogous  to  those  of  the  prokaryotic  two-component 
regulators  (Ota  and  Varshavsky  1993).  Slnlp  functions 
in  the  HOG  signaling  pathway,  which  regulates  the 
response  of  cells  to  .changes  in  the  media  osmolarity 
(Posas  et  al.  1996).  slnl  mutants  are  growth-impaired  in 
presence  of  Ubrlp  and  inviable  in  its  absence  (Ota  and 
Varshavsky  1993).  The  identity  of  a  relevant  Ubrlp 
substrate  in  the  HOG  pathway  is  unknown. 

In  the  present  work,  we  carried  out  a  modified  syn¬ 
thetic  lethal  screen  for  S.  cerevisiae  mutants  that  require 
Ubrlp  for  viability.  Our  findings  uncovered  a  previously 
unsuspected  connection  between  kinesins,  the  N-end 
rule  pathway,  and  the  import  of  histidine. 


Materials  and  methods 

Yeast  strains,  media,  and  genetic  procedures.  S.  cerevisiae  strains 
used  in  this  study  are  listed  in  Table  1.  Yeast  were  grown  in  rich 
(YPD)  medium,  or  in  synthetic  media  containing  0.67%  yeast  ni¬ 
trogen  base  without  amino  acids  (Difco),  auxotrophic  nutrients, 
including  adenine,  arginine,  histidine,  leucine,  lysine,  methionine, 
phenylalanine,  threonine,  tryptophan  and  uracil,  and  either  2% 
dextrose  (glucose)  (SD  medium)  or  2%  galactose  (SG  medium) 
(Sherman  et  al.  1986).  In  some  experiments,  the  concentration  of 
histidine  in  SD  medium  was  increased  10-fold,  to  0.2  mg/ml.  Un¬ 
less  stated  otherwise,  cells  were  incubated  at  30  °C.  Yeast  mating, 
sporulation,  and  tetrad  dissection  were  performed  as  described 
(Sherman  et  al.  1986).  The  mating-type  switch  was  carried  out 
through  transformation  with  the  plasmid  pGAL-HO  (Herskowitz 
and  Jensen  1991).  S.  cerevisiae  were  transformed  as  described  by 
Chen  et  al.  (1992).  The  Escherichia  coli  strain  used  was  DH5a 
(Ausubel  et  al.  1996). 

Synthetic  lethal  screen.  The  haploid  P gali-UBRI  strain  JD54,  in 
which  the  UBRJ  gene  is  expressed  from  the  P GALl  promoter 
(Ghislain  et  al.  1996),  was  mutagenized  with  ethyl  methane- 
sulfonate  (EMS)  (Sigma,  St.  Louis,  Mo.)  to  approximately  7% 
survival  upon  plating  to  SG  medium  at  30  °C.  The  colonies  on  SG 
plates  were  replica-plated  onto  SD  plates.  The  colonies  that  grew 
on  SG  but  not  on  SD  media  were  re-tested  for  the  absence  of 
growth  on  SD  plates.  The  candidate  isolates  were  transformed  with 
plasmids  that  expressed  UBR!  either  from  its  natural  promoter 
[pUBRl  (Bartel  et  al.  1990)]  or  the  much  stronger  P ADHI  promoter, 
and  tested  for  survival  on  SD  plates.  One  mutant,  termed  sln2 
(AVY200),  was  rescued  by  UBRJ  on  SD  plates,  and  was  selected 
for  further  study. 

Isolation  of  HIS3  as  a  suppressor  of  sln2.  The  sln2  mutant  was 
transformed  with  a  yeast  genomic  DNA  library  (American  Type 
Culture  Collection,  #77164)  in  the  TRPl ,  CETVd-based  vector 
pRS200  (Sikorski  and  Hieter  1989).  Cells  were  plated  on  SD(-Trp) 
plates  and  incubated  at  30  °C  for  2-3  days.  Out  of  approximately 
4  x  104  transformants,  25  could  grow  on  SD(-Trp).  Library  plas¬ 
mids  were  rescued  from  these  transformants  and  re- transformed 
into  sln2  cells.  Eight  of  these  plasmids  conferred  viability  on  sln2 
cells  after  re- transformation  and  selection  on  SD(-Trp).  Restriction 
analyses  showed  that  these  plasmids  were  either  identical  or  had 
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Table  1  5.  cerevisiae  strains 
used  in  this  study 


Strain  ,  . 

'  Genotype 

References 

JD52 

MATa  uraS-52  lys2-801  trpl-A63  his3-A200 
leu 2- 3.  112 

Johnson  et  al.  (1995) 

JD54 

M A  /a  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3,  112  GAL1::UBR1 

Ghislain  et  al.  (1996) 

AVY199 

M  A  Tct  ura3-52  lys2-801  trpl-A63  his3-A200 
Ieu2-3,U2  GAL1  ::UBR1 

Derivative  of  JD54 

AVY200a 

M A  Ta  sln2  ura3-52  Ivs2-801  trpl-A63  his3-A200 
leu2-3,  112  GALlr.UBRl 

Derivative  of  JD54 

AVY201 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3.  112  GALlr.UBRl  hiplA-3::URA3 

Derivative  of  JD54 

AVY202 

MATn  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3,  112  GALL: UBRl  cin8A-6::URA3 

Derivative  of  JD54 

AVY203 

MATa  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3,  112  cin8A-I0::URA3 

Derivative  of  JD52 

AVY204 

MATa  sln2  ura3-52  lys2-801  trpl-A63  his3-A200 
leu2-3,  112  GALlr.UBRl  cup9A-l::LEU2 

Derivative  of  AVY200 

a  Mutant  (sln2)  that  requires  UBR1  for  viability 


large  fragments  in  common.  Truncation  and  sequencing  analyses 
localized  the  complementing  activity  to  a  1 .7-kb  BamHl  fragment 
containing  the  HIS3  gene.  This  finding  was  independently 
confirmed  by  showing  that  the  HIS3  open  reading  frame  (ORF), 
expressed  from  the  ?met25  promoter  (Mumberg  et  al.  1994),  was 
also  able  to  complement  the  sln2  mutant  on  SD  plates. 

Histidine  uptake  assay.  This  assay  was  carried  out  as  described 
(Grauslund  et  al.  1995),  with  minor  modifications.  Briefly,  the 
strains  to  be  assayed  were  grown  to  A600  of  about  0.6  in  SD(-His) 
medium  at  30°C.  Cells  were  harvested  from  5-ml  cultures  and 
resuspended  in  0.4-0.5  ml  of  SD(-His)  at  an  identical  cell  density, 
adjusted  using  a  hemocytometer.  For  each  sample,  0.1  ml  of  sus¬ 
pension  was  added  to  0.1  ml  of  SD(-His)  medium  containing  8  gM 
of  L-[,4C]histidine  (Amersham,  Arlington  Heights,  Ill.;  specific 
radioactivity  0.3  Ci/mmol),  and  incubated  at  30°C  for  2  min. 
Samples  were  removed,  filtered  through  a  #30  glass-fiber  filter 
(Schleicher  and  Schuell),  washed  with  10  ml  of  ice-cold  w'ater,  and 
air-dried.  14C  radioactivity  was  determined  using  a  liquid  scintil¬ 
lation  counter.  Since  high  concentrations  of  histidine  in  the  medi¬ 
um  would  have  interfered  with  the  assay,  HIS3  was  introduced  (as 
the  pRS313  plasmid)  into  all  his3A  strains  to  be  tested,  making 
them  prototrophic  for  histidine. 

Northern  hybridization.  S.  cerevisiae  strains  were  grown  at  30°C  to 
A600  of  about  1.  RNA  wras  isolated  as  described  (Schmitt  et  al. 
1990)  and  fractionated  by  electrophoresis  in  formaldehyde-con¬ 
taining  agarose  gels  (Ausubel  et  al.  1996).  Briefly,  approximately 
40  pg  of  RNA  per  lane  was  fractionated  in  a  1.2%  agarose  gel 
containing  0.66  M  formaldehyde,  1  x  MOPS  buffer  and  0.1  pg/ml 
of  ethidium  bromide  (Ausubel  et  al.  1996).  Electrophoresis  was 
performed  at  5  V/cm  in  1  x  MOPS  buffer.  Fractionated  RNA  was 
transferred  onto  a  GeneScreen  Plus  membrane  (DuPont  NEN, 
Boston,  Mass.)  according  fo  the  manufacturer’s  instructions.  DNA 
probes  were  labeled  with  [oe-32P]dCTP  using  a  DNA  labeling  kit 
(Pharmacia  Biotech,  Piscataway,  N.J.).  Hybridization  was  carried 
out  using  a  NorthernMax  kit  (Ambion,  Austin,  Tex.).  Hybridiza¬ 
tion  patterns  were  detected  by  autoradiography  and  quantitated 
using  Phosphorlmager  (Molecular  Dynamics). 

hiplA  mutant.  This  mutant  was  produced  from  JD54  (Table  1) 
using  a  direct  deletion  technique  described  previously  (Baudin  et  al. 
1993).  Briefly,  two  oligodeoxynucleotides  of  66  nt  were  synthesized, 
attagtacacaaatacagatatacaacctcaacatcaaaaatgtcgaaagctacat- 
ataaggaacg  and  gatatggaaattccgctgtatagctcatctctttcccttcattagtttt- 
gctggcctcatcttctc.  One  oligonucleotide  comprised  40  nt  of  a 
sequence  derived  from  the  5'  untranslated  region  (UTR)  of  the 
HIP1  gene,  followed  by  26  nt  of  a  sequence  derived  from  the  UR  A3 
ORF,  beginning  from  its  start  codon.  The  second  oligonucleotide 


comprised  40  nt  of  a  sequence  derived  from  the  3'  UTR  of  HIP l , 
followed  by  26  nt  of  a  sequence  derived  from  the  3'-proximal 
region  of  the  UR  A3  ORF,  including  its  stop  codon.  The  two 
oligonucleotides  were  used  as  PCR  primers  to  amplify  URA3  from 
the  pRS306  vector  (Sikorski  and  Hieter  1989).  The  resulting 
targeting  fragment  contained  URA3  flanked  on  either  side  by  40-bp 
sequences  derived  from,  respectively,  the  5'  and  3'  UTRs  of  HIP I. 
JD54  cells  (Table  1)  were  transformed  with  the  targeting  fragment, 
and  selected  for  Ura+  transformants  at  30°C  on  SD(-Ura)  plates 
containing  0.2  mg/ml  of  histidine.  In  a  typical  experiment, 
approximately  50%  of  Ura+  transformants  were  the  expected 
hip  I A  mutants,  as  verified  by  PCR,  and  also  by  checking  for  the 
phenotype  of  dependence  on  high  concentrations  of  histidine  in  the 
medium. 

Isolation  of  HIP1.  A  gap-repair  method  was  used  to  rescue  HIP1 
from  both  wild-type  (JD54)  and  sln2  (AVY200)  strains  (Rothstein 
1991).  Two  fragments  (the  left  and  right  arms)  flanking  the  gap 
were  amplified  by  PCR  from  either  JD54  or  sln2  genomic  DNA. 
The  left  arm  was  a  693-bp  Xhol-Hindlll  fragment  corresponding  to 
the  5'  UTR  of  HIP  I  from  -976  bp  to  -284  bp.  The  right  arm  was  a 
1044-bp  Hindlll-Xbal  fragment,  corresponding  to  the  3'  UTR  of 
HIP 1 ,  470  bp  downstream  from  the  stop  codon  of  HIP  I.  The  left 
and  right  arms  were  inserted  into  the  Xhol  and  Xbal  sites  of 
pRS315,  a  LEU2 ,  C£jV<5-based  vector  (Sikorski  and  Hieter  1989). 
The  resulting  construct  was  linearized  with  Hindlll ,  and  trans¬ 
formed  into  wild-type  and  sln2  cells  (see  Fig.  3  A).  Plasmids  were 
isolated  from  the  Leu+  transformants.  Gap-filled  plasmids 
(pRS315HIPl)  were  identified  by  restriction  analysis,  and  also  by 
sequencing  the  relevant  junctional  regions.  The  primers  for 
producing  the  left-arm  fragment  were  ggggctgaaacacaagaattcataatg 
and  gcacctacttcaactccttagcag.  The  initial  PCR  product  was 
digested  with  Xhol  and  Hindlll,  yielding  the  left-arm  DNA  frag¬ 
ment.  The  primers  for  producing  the  right  arm  were  ctactacaga- 
tattacatgtggcg  and  ggtttcggtagaatcggtagattg.  The  initial  PCR 
product  was  digested  with  Hindlll  and  Xbal,  yielding  the  right 
arm.  All  PCR  products  were  verified  by  DNA  sequencing. 

Construction  of  the  sln2  cup9A  mutant.  The  previously  described 
cup9::LEU2  disruption  allele  (Byrd  et  al.  1998)  was  used  to  replace 
the  CUP9  allele  in  sln2  cells  through  homologous  recombination 
(Rothstein  1991),  yielding  the  strain  AVY204  (Table  1).  The  ab¬ 
sence  of  Cup9p  was  verified  by  testing  for  the  ability  of  AVY204 
cells  to  import  dipeptides  in  the  absence  of  UBRl  (on  SD  plates) 
(Byrd  et  al.  1998),  and  also  by  PCR  analysis. 

Isolation  of  the  SLN2  (CIN8)  gene.  The  S.  cerevisiae  genomic 
DNA  library  used  for  the  isolation  of  HIS3  (see  above)  was 
transformed  into  the  sln2  strain  (AVY200)  (Table  1).  Cells  were 
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plated  on  SD(-Trp)  plates  containing  histidine  at  0.2  mg/ml,  and 
incubated  at  30°C  for  8  h.  The  temperature  was  then  raised  to 
37°C,  until  the  colonies  formed  (about  3  days).  Out  of  approxi¬ 
mately  6  x  104  transformants  at  30°C,  15  could  grow  on  SD(-Trp, 
+  0.2  mg/ml  His)  at  37°C.  Library  derived  plasmids  were  rescued 
from  these  15  transformants  and  re-transformed  into  sln2  cells, 
with  selection  on  SD(-Trp)  plates  at  37°C.  Three  plasmids 
conferred  viability  on  sln2  cells  after  re-transformation.  Restriction 
mapping  showed  the  inserts  of  these  plasmids  to  be  identical. 
Truncation  and  sequence  analyses  confirmed  that  the  comple¬ 
menting  activity  was  confined  to  the  CIN8  locus. 


rather  than  on  the  nature  of  carbon  source  (dextrose 
versus  galactose),  the  putative  mutants  were  trans¬ 
formed  with  a  low-copy  plasmid  pUBRl  expressing 
UBR1  from  its  natural  promoter,  and  plated  on  SD 
plates.  Only  one  isolate,  termed  sln2  (strain  AVY200; 
Table  1),  formed  colonies  on  SD  plates  in  the  presence 
of  pUBRl  but  not  in  the  presence  of  the  control  vector 
(Fig.  1  A-C).  The  inviability  of  sln2  cells  on  SD  plates 


Isolation  of  the  CIN8 *  allele  from  sln2  cells.  A  gap-repair  method 
(Rothstein  1991)  described  above  for  HI  PI  was  used  to  isolate  the 
mutant  allele  of  CIN8  (denoted  as  CIN8*)  from  sln2  cells.  A  4973- 
bp  EcoK\-BamW\  fragment  containing  wild-type  CIN8 ,  including 
its  5'  (464  bp)  and  3'  (1353  bp)  untranslated  regions  (UTRs),  was 
subcloned  into  the  TRP1- based  vector  pRS314  (Sikorski  and 
Hieter  1989).  The  resulting  pRS314CIN8  was  gapped  by  digestion 
with  Pad  and  Bse RI,  and  transformed  into  the  sln2  strain 
(AVY200).  Plasmids  were  isolated  from  the  Trp+  transformants. 
The  gap-filled  plasmids  (pRS314CIN8*)  were  verified  by  restriction 
analysis  and  sequencing  of  the  relevant  junctional  regions. 

Construction  of  null  cin8  mutants.  An  approximately  2.5-kb  Sphl- 
AatU  DNA  fragment  of  pRS314CIN8  was  replaced  by  a  PCR- 
produced,  approximately  l.l-kb,  URA3-containing  Sphl-AatU 
fragment  of  YIp5,  yielding  pRS314cin8::URA3.  A  SaJA-BatnH\ 
fragment  of  pRS314cin8::URA3  containing  the  cin8:: URA3  dele¬ 
tion/disruption  allele  was  used  to  replace,  through  homologous 
recombination,  the  CIN8  gene  in  JD52,  a  wild-type  (Ubrl  ’ )  strain, 
and  in  JD54,  a  PGAL1::UBRI  strain.  The  resulting  cin8A  mutants 
AVY203  and  AVY202,  respectively  (Table  1),  lacked  codons  169— 
1015  of  the  CIN8  ORF.  The  structure  of  the  cin8A  allele,  and  the 
temperature  sensitivity  of  cinSA  mutants  were  confirmed, 
respectively,  using  PCR  and  growth  tests  at  37°C.  The  primers  for 
producing  the  UR/O-containing  DNA  fragment  were  cattG- 
CATGcgtccgtggaattctcatgtttgac  and  agtgGACGTCtaccaggtgcct- 
gactgcgttagc  (uppercase  letters  denote  the  Sphl  and  Aatli  sites, 
respectively). 


Results 

Isolation  of  the  sln2  mutant 

The  earlier  screen  for  S.  cerevisiae  mutants  requiring  the 
N-end  rule  pathway  (specifically,  Ubrlp)  for  viability 
(Ota  and  Varshavsky  1993)  was  far  from  exhaustive,  and 
in  addition  utilized  wild-type  levels  of  Ubrlp  expression. 
Since  the  activity  of  the  N-end  rule  pathway  is  higher  in 
cells  overproducing  Ubrlp  (Bartel  et  al.  1990;  Madura 
and  Varshavsky  J994),  the  present  screen  was  carried 
out  using  the  strain  JD54  (Ghislain  et  al.  1996),  in  which 
UBRI  was  expressed  from  the  galactose-inducible, 
dextrose-repressible  P  gali  promoter  (Table  1). 
Mutagenized  JD54  cells  were  plated  onto  SG  (galactose- 
containing)  plates,  and  the  colonies  formed  were  replica- 
plated  onto  SD  (dextrose-containing)  plates,  where  the 
N-end  rule  pathway  was  inactive,  owing  to  the  absence 
of  Ubrlp  expression.  Cells  that  yielded  colonies  on  SG 
but  not  on  SD  plates  were  putative  sin  (synthetic  /ethal 
of  V-end  rule)  mutants.  Four  such  isolates  were  identi¬ 
fied  among  approximately  3.5  x  105  colonies  screened. 

To  verify  that  the  conditional  viability  of  these 
isolates  was  in  fact  dependent  on  the  presence  of  Ubrlp, 
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Fig.  1  A-D  A  S.  cerevisiae  mutant  whose  viability  requires  the  N-end 
rule  pathway.  (A  and  B)  the  AVY200  (sln2  P GAL-UBR1)  strain  grows 
on  SG  but  not  on  SD  plates  (see  Results).  (C)  evidence  that  the 
viability  of  sln2  mutant  requires  Ubrlp  rather  than  galactose. 
AVY200  (sln2  PGAL-UBR1)  cells  were  streaked  on  SD  plates  after 
having  been  transformed  either  whh  pUBRl,  a  low-copy  plasmid 
expressing  UBRI  from  its  natural  promoter  or  with  pADH-UBRl, 
another  C£7V-based  plasmid  expressing  UBRI  from  the  constitutive 
and  strong  PAivn  promoter,  or  with  vector  alone.  (D)  sln2  cells  are  ts 
for  growth.  The  parental  JD54  {PGal~UBR1)  strain  and  the  AVY200 
(sln2  P gal-UBRI)  strain  (Table  1)  were  streaked  on  SG  plates  and 
incubated  at  37  °C 
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could  be  rescued  by  UBRl  expressed  at  either  normal 
(pUBRl)  or  much  higher  levels  (pRB208,  which 
expressed  UBRl  from  the  VADH}  promoter)  (Fig.  1  C). 


Histidine  import  defect  of  sln2  cells 
in  the  absence  of  Ubrlp 

Crossing  AVY200  (sln2)  to  the  strain  AVY199  (congenic 
to  JD54  but  of  the  opposite  mating  type)  (Table  1)  in¬ 
dicated  that  the  sln2  mutation  was  recessive  (data  not 
shown).  We  used  a  library  of  S .  cerevisiae  genomic  DNA 
fragments  in  a  low-copy  vector  to  transform  AVY200 
cells,  screening  for  genes  that  could  rescue  the  inviability 
of  sin 2  on  SD  plates  (in  the  absence  of  Ubrlp).  Sur¬ 
prisingly,  HIS3,  a  gene  of  the  histidine  biosynthesis 
pathway,  was  found  to  be  a  strong  suppressor  of  the 
lethality  of  AVY200  (sln2)  on  SD  plates  (Fig.  2  A). 
Since  AVY200  is  a  his  3  A  strain  (Table  1),  HI  S3  is,  by 
definition,  an  extragenic  suppressor  of  the  conditionally 
lethal  sln2  phenotype,  i.e.,  HIS3  is  distinct  from  SLN2. 

The  original  JD54  {his 3 A)  strain,  being  a  histidine 
auxotroph,  required  the  presence  of  histidine  in  the 
medium.  In  the  presence  of  HIS3 ,  the  JD54-derived 
AVY200  (sln2)  mutant  would  be  expected  to  become 
prototrophic  for  histidine.  Could  the  effect  of  the  sln2 
mutation  be  an  impairment  of  the  histidine  uptake  in  the 
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Fig.  2  Inviability  of  the  AVY200  {sln2  PGAL-UBR1)  strain  on  SD 
plates  can  be  rescued  either  by  the  HIS3  gene  (panel  A)  or  by  high 
(>75  pg/ml)  concentration  of  histidine  in  the  medium  (panel  B).  See 
Results  for  details 


absence  of  the  N-end  rule  pathway  (specifically,  in  the 
absence  of  Ubrlp)?  This  interpretation  could  account 
for  the  suppression  of  inviability  of  AVY200  cells  on  SD 
plates  by  the  HIS3  gene,  inasmuch  as  the  resulting  strain 
would  be  a  histidine  prototroph.  We  first  asked  whether 
increasing  the  concentration  of  histidine  in  the  growth 
medium  would  rescue  AVY200  (sln2)  cells  on  SD  plates. 
[It  has  previously  been  showm  that  cells  defective  in 
histidine  uptake  could  grow  on  a  medium  containing 
sufficiently  high  concentrations  of  histidine  (Tanaka  and 
Fink  1985).]  Indeed,  AVY200  (sln2  P gal-UBRI)  cells 
were  found  to  grow  on  SD  plates  containing  histidine  at 
200  pg/ml,  but  could  not  grow  at  the  standard  histidine 
concentration  of  20  |ig/ml  (Fig.  2  B). 

To  examine  the  histidine  import  of  AVY200  (sln2) 
cells  in  a  different  way,  we  compared  the  uptake  of 
L-[14C]histidine  into  sln2  and  other  yeast  strains.  Since 
high  concentrations  of  histidine  in  the  medium  would 
have  interfered  with  the  assay,  we  introduced  the  HIS3 
gene  into  all  four  of  the  his 3 A  strains  to  be  tested, 
making  them  prototrophic  for  histidine.  As  shown  in 
Fig.  3,  in  the  absence  of  Ubrlp  the  relative  rate  of 
histidine  uptake  in  AVY200  (sin 2  P GAL-UBR1)  cells  was 
about  35%  of  the  rate  in  the  parental  JD54  ( SLN2 
^gal-UBRI)  cells.  In  the  presence  of  Ubrlp  (at  its 
wildtype  concentration)  the  rate  of  histidine  uptake  in 
sln2  cells  increased  to  about  75%  of  that  in  SLN2  cells 
(Fig.  3),  thus  accounting,  at  least  in  part,  for  the 
observed  growth  of  sln2  cells  on  SG  plates  (Fig.  1  B). 
The  incomplete  recovery  of  histidine  uptake  in  Ubrlp- 
expressing  AVY200  (sln2)  cells  was  consistent  with  their 


SLN2  s\n2  hiplA  sin2(UBR1) 

Fig.  3  sln2  cells  are  impaired  in  the  import  of  histidine.  SLN2  is  the 
parental  JD54  strain  (PG/1L- UBRl)  (Table  1).  hiplA  is  a  HIP /-lacking 
derivative  of  JD54.  sln2  (pUBRl)  is  the  AVY200  {sln2  P gal-UBRI) 
strain  carrying  pUBRl.  The  uptake  of  L-[14C]histidine  was  measured 
as  described  in  Materials  and  methods.  The  results  are  expressed  as 
the  percent  of  uptake  by  SLN2  cells,  and  represent  the  means  from 
three  independent  experiments.  Standard  deviations  are  shown  above 
the  bars 
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detectably  slower  growth  on  SG  plates  in  comparison  to 
JD54  ( SLN2 )  cells  (Fig.  1  A,B).  Note  that  the  decreased 
(about  35%  of  normal)  rate  of  14C-histidine  import  in 
sln2  cells  was  still  considerably  higher  than  the  one  in 
hiplA  cells,  which  lack  the  histidine-specific  transporter 
(Fig.  3).  This  observation  is  in  agreement  with  the  fact 
that  the  minimal  concentration  of  histidine  that, allowed 
the  growth  of  sln2  cells  on  SD  plates  (75  pg/ml)  was 
much  lower  than  the  concentration  of  histidine  (200  pg/ 
ml)  required  for  the  comparable  growth  rate  of  hiplA 
cells  (data  not  shown). 


Down-regulation  of  HIP1  in  the  sln2  ubrlA  mutant 
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In  S .  cerevisiae ,  the  import  of  histidine  across  the  plasma 
membrane  is  carried  out  largely,  if  not  exclusively,  by 
two  transporters  (permeases),  Hip  Ip  and  Gap  Ip  (Tan¬ 
aka  and  Fink  1985;  Jauniaux  and  Grenson  1990).  Gaplp 
is  a  general  amino-acid  transporter  whose  activity  is 
repressed  by  rich  nitrogen  sources  such  as  NH4  in  SD  or 
SG  media  (Jauniaux  and  Grenson  1990;  Hein  et  al. 
1995).  Consequently,  the  histidine-specific  permease 
Hiplp  is  the  main  importer  of  histidine  under  these 
conditions.  The  reduced  uptake  of  histidine  by  AVY200 
(sln2)  cells  in  the  absence  of  Ubrlp  (in  the  absence  of  the 
N-end  rule  pathway)  could  be  caused  by  a  defective 
HIP l  allele  in  sln2  cells  or  by  a  reduced  expression  of  the 
otherwise  wild-type  HIPL 

To  address  the  first  possibility,  we  used  the  method 
of  gap  repair  (Fig.  4  A)  (see  Materials  and  methods)  to 
isolate  HIPl  from  both  AVY200  (sln2)  and  the  parental 
JD54  ( SLN2 )  strains.  Complementation  testing  of  HIP1 
from  these  strains  with  hiplA  S .  cerevisiae  showed  that 
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Fig.  4  A-C  Gap  repair-based  cloning  of  the  HIP  I  and  CIN8*  alleles 
from  sln2  cells.  Panels  A  and  B  illustrate,  respectively,  the  isolation  of 
HIPl  and  an  allele  of  CIN8  ( CIN8 *)  from  AVY200  (sln2  P GAL- 
UBR1 )  cells.  C  deletion/disruption  of  the  CIN8  gene.  See  Materials 
and  methods 


Fig.  5  Northern-hybridization  analyses  of  the  S.  cerevisiae  HIPl 
mRNA  in  different  genetic  backgrounds.  (A  and  B)  32P-labeled  HIPl 
and  ACT1  (actin)  DNA  probes  (the  latter  appropriately  diluted)  were 
used  to  determine  the  levels  of  the  corresponding  mRNAs  in 
electrophoretically  fractionated  RNA  from  strains  whose  relevant 
genetic  backgrounds  are  indicated  at  the  top  of  panels.  “pUBRl’'  and 
“pADH-UBRl”  denote  RNA  from  strains  carrying  low-copy 
plasmids  expressing  UBR1  from  either  the  natural  UBR1  promoter 
or  from  the  P Adhi  promoter.  See  also  Materials  and  methods 


the  two  HIPl  alleles  were  indistinguishable  in  their 
ability  to  restore  histidine  import  in  hiplA  cells  (data 
not  shown),  strongly  suggesting  that  sln2  cells  contained 
the  wild-type  version  of  HIPL  Quantitative  Northern 
hybridization  was  then  used  to  compare  the  levels  of 
HIPl  mRNA  in  sln2  and  congenic  SLN2  cells  in  the 
absence  and  presence  of  Ubrlp  (Fig.  5  and  see  Mate¬ 
rials  and  methods).  The  relative  level  of  HIPl  mRNA 
was  reproducibly  about  2-fold  lower  in  sln2  cells  lack¬ 
ing  Ubrlp  than  in  otherwise  identical  SLN2  cells  or  in 
Ubrlp-containing  SLN2  cells  (Fig.  5  A  and  see  Ma¬ 
terials  and  methods).  This  finding  strongly  suggested 
that  the  in  viability  of  Ubrlp-lacking  AVY200  (sln2 
^*gal-UBR1)  cells  at  the  standard  concentration  of 
histidine  in  SD  medium  was  caused  by  underexpression 
of  the  Hiplp  transporter.  Consistent  with  this  interpr¬ 
etation,  we  found  that  about  a  2-fold  increase  in  the 
HIPl  copy  number,  through  transformation  with  a 
low-copy  plasmid  expressing  HIPl  from  its  natural 
promoter,  rescued  the  in  viability  of  AVY200  cells  on 
SD  plates  (Fig.  6  A).  Note  that  the  HIS3  gene  on  a  low- 
copy  plasmid  rescued  AVY200  (sln2  PGAL-UBR1)  cells 
more  effectively  than  did  the  plasmid-borne  extra  copy 
of  HIPl  (Fig.  6  A),  again  in  agreement  with  the  ob¬ 
served  down-regulation  of  HIPl  in  AVY200  cells.  This 
finding  also  accounted,  in  hindsight,  for  the  isolation  of 
HIS3 ,  but  not  of  HIPl ,  in  our  (far  from  exhaustive) 
complementation  screen. 


Ubrlp  increases  the  expression  of  HIPl  in  sln2 
but  not  in  SLN2  cells 

The  effect  of  UBR1  expression  on  the  uptake  of  histidine 
by  Ubrlp-lacking  sln2  cells  (Fig.  3),  and  the  requirement 
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Fig.  6A-D  Effects  of  HIPl  and  Cl  NS *  on  the  growth  of  strains 
lacking  the  N-end  rule  pathway.  A  AVY200  (sln2  PGAL-UBRI)  cells 
(Table  1)  carrying  a  low-copy,  L£t/2-marked  vector  pRS315  or  the 
otherwise  identical  plasmids  expressing  either  HI  S3  or  HIP1  were 
grown  on  SD(~Leu,  +0.2  mg/ml  of  histidine)  plates  to  allow  colony 
formation  by  all  strains.  Individual  colonies  were  then  streaked  onto 
SD(-Leu)  plates.  Note  the  rescue  of  the  AVY200  (sln2  P gal-UBRI) 
cells  by  HIP l  and  their  even  stronger  rescue  by  HIS3.  B  neither  the 
CIN8 *  allele  (isolated  from  sln2  cells)  nor  the  corresponding  TRP1- 
based  vector  pRS314  rescued  inviability  of  AVY200  (sln2  PGAL- 
UBR1)  cells  on  SD  plates,  in  contrast  to  the  wild-type  CIN8  gene.  C 
cinSA  cells  (AVY202  and  AVY203,  respectively)  derived  from  either 
the  JD54  (PGALrUBRl)  or  JD52  (UBRl)  strains  (Table  1)  were 
streaked  onto  SD  plates.  Note  inviability  of  cinSA  cells  in  the  absence 
of  UBRl  expression.  Cells  were  grown  at  first  on  SD(-Ura,  +  0.2  mg/ 
ml  of  histidine)  plates  to  allow  colony  formation  by  both  strains. 
Individual  colonies  were  then  streaked  onto  SD(-Ura)  plates.  D  same 
as  in  C  but  cin8A  JD54  (cinSA  P GAL-UBRI)  cells  carried  either  the 
control  vector  pRS314  or  the  same  vector  expressing  HIS3 .  See 
Materials  and  methods 


of  Ubrlp  for  the  viability  of  these  cells  on  SD  plates 
(Fig.  1  A-C),  could  be  due  to  an  increase  of  HIP l 
expression  in  the  presence  of  Ubrlp.  Comparisons  of 
HIP l  mRNA  levels  among  sln2  strains  that  either  lacked 
Ubrlp  or  produced  increasing  amounts  of  Ubrlp 
showed  that  the  expression  of  HIP1  was  indeed  in¬ 
creased  in  the  presence  of  Ubrlp  in  sln2  cells,  but  not  to 
its  wild-type  level  (Fig.  5).  As  quantified  with  Phos- 
phorlmager  and  internal  reference,  the  increase  caused 
by  the  presence  of  Ubrlp  was,  reproducibly,  about  1.5- 
fold,  in  comparison  to  the  approximately  2-fold  increase 
that  would  have  been  required  for  restoration  of  the 
wild-type  level  of  HIP1  mRNA  (Fig.  5  and  data  not 
shown).  The  partial  restoration  of  HIP1  expression  by 
Ubrlp  in  sln2  cells  is  consistent  with  the  observed  partial 
recovery  of  histidine  uptake  (Fig.  3),  and  with  lower 
than  wild-type  growth  rates  of  Ubr lp-supplemented  sln2 
cells  (Fig.  1  B). 

We  also  asked  whether  the  presence  of  Ubrlp  could 
increase  the  expression  of  HIP l  in  SLN2  cells.  As 
shown  in  Fig.  5  A,  the  expression  of  Ubrlp,  at  in¬ 
creasing  levels,  in  SLN2  ubrlA  cells  did  not  influence 
the  expression  of  HIPl.  Taken  together,  our  results 
indicated  that  at  least  two  distinct  pathways  influence 
the  expression  of  HIPl :  a  SLY2-dependent  pathway 
and  the  UBRl -dependent  N-end  rule  pathway.  The 
latter  pathway  is  sufficient  for  the  viability-maintaining 
expression  of  HIPl  in  the  absence  of  SLN2.  Although 
the  expression  of  HIPl  was  not  extinguished  even  in 
the  absence  of  both  SLN2  and  UBRl  (Fig.  5  A),  it 
became  low  enough  to  result  in  growth  arrest  at  the 
standard  concentration  of  histidine  in  the  medium 
(Fig.  1  A). 

Previous  work  (Byrd  et  al.  1998)  has  shown  that  the 
N-end  rule  pathway  strongly  up-regulates  the  expression 
of  S.  cerevisiae  Ptr2p,  a  peptide  transporter,  through  the 
degradation  of  Cup9p,  a  transcriptional  repressor  of 
PTR2.  We  therefore  tested  whether  the  Ubrlp-depen- 
dent  increase  of  HIPl  expression  in  AVY200  (sln2  P gal- 
UBRI)  cells  (Fig.  5)  involves  the  degradation  of  Cup9p, 
which  is  known  to  either  repress  or  activate  several  yeast 
genes  (G.  Turner,  H.  Rao,  and  A.V.,  unpublished  data). 
A  sln2  cup9A  PGAL-UBR1  strain  was  constructed;  the 
levels  of  HIPl  mRNA  were  compared  between  sln2 
CUP9  and  sln2  cup9A  strains,  and  were  found  to  be 
indistinguishable  (Fig.  5  A).  Thus,  the  effect  of  the  N- 
end  rule  pathway  on  the  expression  of  HIPl  is  unlikely 
to  involve  Cup9p.  These  Northern-hybridization  data 
(Fig.  5  A)  were  also  consistent  with  the  observation  that 
a  deletion  of  CUP9  did  not  rescue  the  inviability  of 
AVY200  (, sln2  PGAL-UBRl)ce\\s  on  SD  plates  (data  not 
shown). 


SLN2  is  CIN8 

As  described  above,  both  UBRl  and  HIS3  acted  as  ex- 
tragenic  suppressors  of  the  sln2  mutation.  To  identify 
the  SLN2  gene,  we  searched  for  other  phenotypes  of 
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AVY200  ( sln2  PGAL-UBR1 )  cells,  and  found  that  they 
were  inviable  at  37  °C.  The  result  of  crossing  AVY200 
with  the  SLN2  AVY199  strain  (Table  1)  indicated  that 
the  phenotype  of  temperature  sensitivity  ( ts )  was  reces¬ 
sive  (data  not  shown).  Interestingly,  the  ts  phenotype  of 
sln2  cells  could  not  be  rescued  by  either  UBR1  (Fig.  1  D) 
or  HIS3  (data  not  shown),  indicating  that  the jln2  mu¬ 
tation  affected  not  only  histidine  uptake.  Since  the  sln2l 
SLN2  ( A VY 200/A VY 1 99)  diploid  sporulated  ineffi¬ 
ciently,  and  the  viability  of  the  few  spores  that  did  form 
was  very  low,  it  was  difficult  to  preclude,  through  tetrad 
analysis,  the  alternative  possibility  that  sln2  cells  actually 
bore  more  than  one  mutant  gene.  Therefore  we  at¬ 
tempted  to  identify  a  gene(s)  that  would  rescue  the  ts 
phenotype  of  sln2  cells,  and  thereafter  to  determine 
whether  that  gene  could  rescue  the  histidine-uptake  de¬ 
fect  as  well. 

A  standard  complementation  screen  (see  Materials 
and  methods)  yielded  CIN8 ,  a  previously  described 
gene  encoding  a  kinesin-like  protein  (Hoyt  et  al.  1992; 
Roof  et  al.  1992).  CIN8  was  found  to  alleviate  not 
only  the  ts  phenotype  of  sln2  cells,  but  their  histidine- 
uptake  defect  as  well  (Fig.  6  B),  suggesting  that  the 
sought  SLN2  gene  may  be  CIN8.  To  determine 
whether  the  AVY200  (sln2  P gal~UBR1)  strain  carried 
a  mutant  allele  of  CIN8 ,  we  isolated  it  from  these  cells 
using  the  method  of  gap  repair  (Fig.  4  B  and  see 
Materials  and  methods).  Sequencing  of  this  allele, 
termed  CIN8 *,  showed  the  presence  of  the  A  — >  T 
mutation  at  position  1237,  resulting  in  a  premature 
stop  codon  (TGA)  at  codon  413  of  the  1038-codon 
CIN8  ORF  (data  not  shown).  The  resulting  truncated 
product  would  lack  the  entire  stalk/tail  region  and  a 
portion  of  the  putative  motor  domain  (Hoyt  et  al. 
1992).  As  expected,  CJN8 *,  in  contrast  to  CIN8,  was 
unable  to  rescue  the  histidine-uptake  defect  (inviability 
on  SD  plates)  of  AVY200  cells  (Fig.  6  B).  These  re¬ 
sults  made  it  very  likely  that  the  SLN2  gene  was  in 
fact  CIN8 . 

To  produce  independent  evidence  bearing  on  this 
conjecture,  we  deleted  the  bulk  of  CIN8  from  the 
JD52  (his3A  UBR1)  and  JD54  (his3A  P GAL-UBR1) 
strains  (the  latter  strain  is  Ubrl  on  dextrose)  (see 
Fig.  4  C  and  Materials  and  methods).  The  resulting 
cin8A  strains  AVY203  (his3A  cin8A  UBRl)  and 
AVY202  (his3A  cin8A  PGAL-UBR1)  (Table  1)  were 
tested  for  viability  on  SD  plates,  i.e.,  for  comple¬ 
mentation  of  the  histidine  uptake  defect.  Whereas  the 
AVY203  (cin8A  UBR1)  cells  were  viable  under  these 
conditions,  the  AVY202  (his3A  cin8A  P GAL-UBR1) 
cells  wrere  inviable  (Fig.  6  C),  a  set  of  phenotypes  in¬ 
distinguishable  from  that  conferred  by  the  sln2  muta¬ 
tion.  Moreover,  as  would  have  be  expected  if  SLN2 
and  CIN8  were  the  same  gene,  the  HIS3  gene  was 
found  to  rescue  Ubrlp-lacking  cin8A  cells  on  SD 
plates  (Fig.  6  D),  identically  to  the  effect  of  HIS3  on 
sln2  cells  (Fig.  2  A).  Taken  together,  these  results 
(Figs.  4  and  6)  indicated  that  sln2  was  a  defective 
(truncated)  allele  of  the  CIN8  gene. 


Discussion 

We  report  the  following  results. 

(1)  A  synthetic  lethal  screen  for  S.  cerevisiae  mutants 
that  require  the  presence  of  the  N-end  rule  pathway 
(specifically,  the  presence  of  its  recognition  component 
Ubrlp)  for  viability  yielded  a  mutant  termed  sln2.  The 
viability  of  sln2  ubrl  cells  (specifically,  of  sln2  P GAL- 
UBR1  cells  on  dextrose-containing  media)  could  be 
rescued  by  UBR1  expressed  at  either  normal  or  in¬ 
creased  levels. 

(2)  The  HIS3  gene  (which  was  absent  from  the  pa¬ 
rental  “wild-type”  strain)  was  found  to  rescue  the  invi¬ 
ability  of  Ubrlp-lacking  sln2  cells.  High  (>75  pg/ml) 
concentrations  of  histidine  in  the  medium  could  also 
rescue  these  cells. 

(3)  This  defect  in  histidine  uptake,  exhibited  by 
Ubrlp-lacking  sln2  cells  but  not  by  SLN2  ubrl  A  cells, 
was  traced  to  the  HIP1  gene,  encoding  the  histidine 
transporter.  The  HIP1  allele  in  Ubrlp-lacking  sln2  cells 
was  wild  type,  but  the  level  of  HIP l  mRNA  in  these  cells 
was  reproducibly  about  2-fold  lower  than  in  parental 
(SLN2)  cells.  In  addition,  the  relative  rate  of  l4C-histi- 
dine  uptake  by  Ubrlp-lacking  sln2  HI  S3  cells  was  about 
35%  of  that  by  SLN2  HIS3  cells  and  about  50%  of  that 
by  sln2  UBR1  HI  S3  cells.  The  incomplete  shutoff  of 
histidine  import  in  Ubrlp-lacking  sln2  cells  was  consis¬ 
tent  with  their  rescue  by  high  concentrations  of  histidine 
in  the  medium.  Further,  the  incomplete  restoration  of 
histidine  import  in  Ubrlp-expressing  sln2  cells  was 
consistent  with  their  detectably  slower  growth  on  SD 
plates  in  comparison  to  either  SLN2  UBR1  or  SLN2 
ubrlA  cells.  Finally,  a  doubling  of  the  HIP1  copy 
number  in  Ubrlp-lacking  sln2  cells  partially  comple¬ 
mented  their  inviability  on  SD  plates.  Thus,  the  histi¬ 
dine-uptake  defect  of  Ubrlp-lacking  sln2  cells  was 
caused  at  least  in  part  by  underexpression  of  Hiplp. 

(4)  Yet  another  property  of  the  sin 2  mutant  was  its 
inviability  at  37  °C,  which  could  not  be  rescued  by  either 
HI  S3  or  UBRL  This  made  possible  the  cloning  of  SLN2 , 
which  was  found  to  be  a  previously  characterized  gene 
called  C1N8,  encoding  a  kinesin-like  protein  (Hoyt  et  al. 
1992;  Roof  et  al.  1992).  The  dependence  of  histidine 
transport  by  cin8A  cells  on  the  presence  of  Ubrlp  is  a 
previously  unknown  property  of  cin8A  cells. 

Cin8p  participates  in  the  assembly  of  a  bipolar 
spindle.  cin8  mutants  are  viable,  owing  to  the  presence  of 
Kiplp,  another  kinesin-related  motor  protein  and  a 
functional  homolog  of  Cin8p;  at  least  one  of  the  two 
proteins  is  required  for  cell  viability  (Hoyt  et  al.  1992; 
Roof  et  al.  1992).  Geiser  et  al.  (1997)  identified  a  num¬ 
ber  of  other  genes  that  are  required  for  viability  in  the 
absence  of  CIN8.  These  genes  encode  proteins  that 
function  in  cell-division  pathways  (spindle  motors, 
microtubule  stabilizers,  checkpoint  regulators). 

In  retrospect,  it  was  an  accident  of  the  absence  of 
HIS3  (rather  than  of  another  genetic  marker)  from  the 
parental  JD54  strain  (P GAL-UBR1)  that  led  us  to  the 
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finding,  through  a  synthetic  lethal  screen,  of  a  connec¬ 
tion  between  a  kinesin-like  protein,  the  N-end  rule 
pathway,  and  regulation  of  the  histidine  import.  Pre¬ 
liminary  tests  for  the  ability  of  Ubrlp-lacking  sln2  cells 
to  import  either  dipeptides  or  several  amino  acids  other 
than  histidine  indicated  approximately  wild-type  rates  of 
uptake  (data  not  shown). 

A  short-lived  substrate  of  Ubrlp  that  mediates  its 
influence  on  the  phenotype  of  cin8A  ( sln2 )  cells  is  un¬ 
known.  At  the  same  time,  it  is  clear  that  a  connection 
between  the  N-end  rule  pathway  and  a  Cin8p-dependent 
process  involves  the  expression  of  the  histidine  trans¬ 
porter  Hiplp.  A  parsimonious  model  which  is  consistent 
with  the  available  evidence  posits  a  short-lived  tran¬ 
scriptional  repressor  of  the  HIP1  gene  that  is  degraded 
by  the  N-end  rule  pathway,  and  is  also  down-regulated, 
in  an  unknown  manner,  through  a  Cin8p-dependent 
pathway  (Fig.  7  A).  This  model  is  analogous  to  the 
previously  deciphered  circuit  that  controls  the  expres¬ 
sion  of  the  peptide  transporter  Ptr2p  in  S.  cerevisiae.  It 
was  found  that  Cup9p,  a  homeodomain-containing 
transcriptional  repressor  of  PTR2 ,  is  a  short-lived  pro¬ 
tein  targeted  for  Ubrlp-dependent  degradation,  appar¬ 
ently  through  an  internal  degron  of  Cup9p  (Alagramam 
et  al.  1995;  Byrd  et  al.  1998).  In  the  absence  of  Ubrlp, 
the  increased  concentration  of  Cup9p  results  in  a  vir¬ 
tually  complete  shutoff  of  the  PTR2  gene  and  cessation 
of  peptide  import  (Byrd  et  al.  1998).  We  know  (see 
Results)  that  Cup9p  is  not  involved  in  the  regulation  of 
HIP1  expression. 

An  alternative  model  invokes  a  positive  regulator  of 
HIP1  that  is  positively  regulated  by  both  the  N-end  rule 
and  Cin8p-mediated  pathways  (Fig.  7  B).  A  transporter 
homolog  Ssylp  of  S.  cerevisiae  has  been  shown  to  be 
required  for  the  expression  of  several  amino-acid 
transporters,  including  Bap2p,  Bap3p  and  Tatlp,  and 
also  the  peptide  transporter  Ptr2p  (Didion  et  al.  1998). 
Ssylp  appears  to  function  at  least  in  part  as  a  sensor  of 
specific  amino  acids  in  the  medium.  Although  SSY1  is 
unnecessary  for  histidine  uptake  (Didion  et  al.  1998),  it 
is  possible  that  an  analogous  plasma  membrane-em¬ 
bedded  sensor  activates  the  synthesis  of  other  trans¬ 
porters,  including  Hiplp.  The  absence  of  Cin8p  may 
affect  the  secretory  pathway,  resulting  in  lower  amounts 
of  the  postulated  sensor  in  the  plasma  membrane. 
Ubrlp,  the  E3  component  of  the  ubiquitin-dependent  N- 
end  rule  pathway,  may  up-regulate  this  sensor;  for  ex¬ 
ample,  through  the  degradation  of  another  E3  enzyme 
that  targets  the  sensor  for  endocytosis  and  degradation 
in  the  vacuole.  The  involvement  of  the  ubiquitin  system 
in  the  regulated  endocytosis  and  lysosome/vacuole  de¬ 
livery  of  transmembrane  proteins  has  previously  been 
demonstrated  in  both  yeast  and  multicellular  eukaryotes 
(Galan  et  al.  1996;  Hicke  and  Riezman  1996). 

One  prediction  of  the  repressor-based  model  (Fig.  7 
A)  is  that  inactivation  of  the  repressor  of  HIP1  should 
suppress  the  down-regulation  of  HIP1  caused  by  the 
absence  of  both  Cin8p  and  Ubrlp.  A  genetic  screen  for 
mutants  of  this  class  is  under  way.  An  analogous  earlier 


Fig.  7  A,B  Possible  interconnections  between  the  N-end  rule  path¬ 
way,  the  regulation  of  histidine  import,  and  a  kinesin-dependent 
pathway.  A  In  this,  apparently  most-parsimonious  model,  a  short¬ 
lived  transcriptional  repressor  of  the  HI  PI  gene  is  degraded  by  the  N- 
end  rule  pathway,  and  is  also  down-regulated,  in  an  unknown 
manner,  through  a  CinSp-dependent  pathway.  B  in  an  alternative 
model,  a  positive  regulator  of  HIP l  is  positively  regulated  by  both  the 
N-end  rule  and  Cin8p-mediated  pathways 


screen  for  suppressors  of  the  inability  of  ubrlA  cells  to 
import  peptides  has  led  to  the  identification  of  Cup9p  as 
a  transcriptional  repressor  of  the  peptide  transporter 
gene  PTR2  (Byrd  et  al.  1998). 
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Analysis  of  a  conditional  degradation  signal  in  yeast  and  mammalian  cells 

Frederic  Levy*,  Jennifer  A.  Johnstonf  and  Alexander  Varshavsky 
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The  N-end  rule  pathway  is  a  ubiquitin-dependent  proteolytic  system,  the  targets  of  which  include  proteins  that  bear 
destabilizing  N-terminal  residues.  The  latter  are  a  part  of  the  degradation  signal  called  the  N-degron.  Arg-DHFRl\ 

*  an  engineered  N-end  rule  substrate,  bears  N-terminal  arginine  (a  destabilizing  residue)  and  DHFRts  [a  temperature- 

sensitive  mouse  dihydrofolate  reductase  (DHFR)  moiety].  Previous  work  has  shown  that  Arg-DHFRts  is  long-lived 
at  23  °C  but  short-lived  at  37  °C  in  the  yeast  Saccharomyces  cerevisiae.  In  the  present  work,  we  extended  this 
^  analysis,  and  found  that  the  degradation  of  Arg-DHFRts  can  be  nearly  completely  inhibited  in  vivo  by  methotrexate 

(MTX),  a  low-M,  ligand  of  DHFR.  In  S.  cerevisiae ,  Arg-DHFRts  is  degraded  at  37  °C  exclusively  by  the  N-end  rule 
pathway,  whereas  in  mouse  cells  the  same  protein  at  the  same  temperature  is  also  targeted  by  another  proteolytic 
system,  through  a  degron  in  the  conformationally  perturbed  DHFRts  moiety.  In  mouse  cells,  MTX  completely 
inhibits  the  degradation  of  Arg-DHFRls  through  its  degron  within  the  DHFRts  moiety,  but  only  partially  inhibits 
degradation  through  the  N-degron.  When  the  N-terminus  of  Arg-DHFRts  was  extended  with  a  42-residue  lysine- 
lacking  extension,  termed  eAK,  the  resulting  Arg-eAK-DHFRts  was  rapidly  degraded  at  both  23  °C  and  37  °C. 
Moreover,  the  degradation  of  Arg-eAK-DHFRts,  in  contrast  with  that  of  Arg-DHFRts,  could  not  be  inhibited  by  MTX, 
suggesting  that  the  metabolic  stability  of  Arg-DHFRts  at  23  °C  results,  at  least  in  part,  from  steric  inaccessibility  of 
its  N-terminal  arginine.  The  N-degron  of  Arg-DHFRts  is  the  first  example  of  a  portable  degradation  signal  the 
activity  of  which  can  be  modulated  in  vivo  by  a  cell-penetrating  compound.  We  discuss  implications  of  this  advance 
and  the  mechanics  of  targeting  by  the  ubiquitin  system. 

Keywords :  degron;  methotrexate;  N-end  rule;  proteolysis;  ubiquitin. 


A  number  of  regulatory  circuits,  including  those  that  control  the 
cell  cycle,  cell  differentiation  and  responses  to  stress,  involve 
metabolically  unstable  proteins  [1-6].  A  short  in  vivo  half-life 
of  a  regulator  provides  a  way  to  generate  its  spatial  gradient  and 
allows  rapid  adjustments  of  its  concentration,  or  subunit 
composition,  through  changes  in  the  rate  of  its  synthesis.  A 
protein  can  also  be  conditionally  unstable,  i.e.  long-lived  or 
short-lived,  depending  on  the  state  of  the  cell.  One  example  of 
the  latter  class  are  cyclins,  a  family  of  related  proteins  the 
destruction  of  which  at  specific  stages  of  the  cell  cycle  regulates 
cell  division  and  growth  [3,7]. 

Features  of  proteins  that  confer  metabolic  instability  are  called 
degradation  signals,  or  degrons  [8].  An  essential  component  of 
one  degradation  signal,  called  the  N-degron,  is  a  destabilizing 
N-terminal  residue  of  a  protein  [9,10].  A  set  of  N-degrons 
containing  different  N-terminal  residues  which  are  destabilizing 
in  a  given  cell  yields  a  rule,  termed  the  N-end  rule,  which  relates 
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the  in  vivo  half-life  of  a  protein  to  the  identity  of  its  N-terminal 
residue.  In  eukaryotes,  the  N-end  rule  pathway  is  a  part  of  the 
ubiquitin  (Ub)  system.  Ub  is  a  76-residue  protein  whose 
covalent  conjugation  to  other  proteins  plays  a  role  in  a  multitude 
of  processes,  including  cell  growth,  differentiation  and  responses 
to  stress  [2.4,1 1-13],  In  many  of  these  settings,  Ub  acts  through 
routes  that  involve  the  degradation  of  Ub-protein  conjugates  by 
the  26S  proteasome,  an  ATP-dependent  multisubunit  protease 
[14,15]. 

In  eukaryotes,  linear  Ub-protein  fusions  are  rapidly  cleaved 
by  Ub-specific  proteases  (UBPs)  at  the  Ub-protein  junction, 
making  possible  the  production  of  otherwise  identical  proteins 
bearing  different  N-terminal  residues,  a  technical  advance  that 
led  to  the  discovery  of  the  N-end  rule  [9,10].  When  mouse 
dihydrofolate  reductase  (DHFR),  a  20-kDa  monomeric  protein, 
was  expressed  as  a  Ub-Arg-DHFR  fusion  in  the  yeast  Sac¬ 
charomyces  cerevisiae  at  30  °C,  the  resulting  Arg-DHFR  (pro¬ 
duced  through  deubiquitylation)  was  long-lived  (r0  5  >  4  h), 
even  though  it  bore  N-terminal  Arg,  a  destabilizing  residue  [16]. 
By  contrast,  a  modified  protein,  Arg-eK-DHFR  (produced  from 
Ub-Arg-eK-DHFR),  which  bore  a  42-residue,  lysine-containing 
extension  (denoted  as  eK)  between  the  N-terminal  Arg  residue 
and  the  first  (Val)  residue  of  DHFR,  was  short-lived  in  vivo  (t()5 
of  ~  10  min),  being  degraded  by  the  N-end  rule  pathway  [16], 

This  and  other  evidence  indicated  that  the  N-degron  com¬ 
prises  two  essential  determinants:  a  destabilizing  N-terminal 
residue  and  an  internal  Lys  residue  (or  residues)  of  a  substrate. 
The  Lys  residue  is  the  site  of  attachment  of  a  multi- Ub  chain 
[16-18].  Arg-eK-DHFR  contains  a  complete  N-degron  because 
it  bears  both  an  N-terminal  destabilizing  residue  (Arg)  and,  in 
the  eK  extension,  at  least  one  targetable  Lys  residue.  In  contrast, 
the  N-degron  of  Arg-DHFR  is  incomplete,  for  at  least  one  of  the 
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two  mutually  non-exclusive  reasons:  the  absence  of  Lys  residues 
accessible  to  the  targeting  complex  of  the  N-end  rule  pathway 
and/or  poor  accessibility  of  the  N-terminal  Arg  in  Arg-DHFR  to 
the  same  targeting  complex.  Since  the  20-kDa  mouse  DHFR 
contains  16  Lys  residues,  this  interpretation  presumes  that,  in  a 
folded  DHFR  molecule,  the  lysines  are  ineffective  as  ubiquity- 
lation  sites,  because  of  the  lysine’s  lack  of  mobility  and/or  its 
distance  from  a  destabilizing  N-terminal  residue  [10]. 

Previous  work  [19]  described  a  temperature-sensitive  ( ts ) 
allele  of  DHFR  that  was  used  to  construct  a  heat-inducible 
N-degron,  in  a  Ub-Arg-DHFRts-Ura3p  fusion  which  contained 
S.  cerevisiae  Ura3p  as  a  C-terminal  reporter.  The  Ub-Arg- 
DHFRls  moiety  of  this  fusion  was  identical  with  Ub-Arg-DHFR 
above,  except  for  a  Pro  — >  Leu  alteration  at  position  66  of  the 
^  DHFR  moiety.  Arg-DHFR,S-Ura3p  was  long-lived  at  23  °C  but 
short-lived  at  37  °C  in  yeast.  Moreover,  the  Ub-Arg-DHFRts 
module  of  Ub-Arg-DHFRts-Ura3p  was  shown  to  be  portable,  in 
that  linking  it  to  a  protein  of  interest  conferred  heat-inducible 
metabolic  instability  on  the  entire  fusion  [19]. 

In  addition  to  yielding  a  new  method  for  the  construction  of  ts 
mutants,  the  heat-inducible  N-degron  [19]  provided  an  approach 
to  analyzing  the  mechanism  of  targeting  by  the  N-end  rule 
pathway.  In  carrying  out  this  analysis,  described  below,  we  have 
found  that  the  degradation  of  Arg-DHFRts  can  be  inhibited  in 
vivo  by  methotrexate  (MTX),  a  low-M,  ligand  of  DHFR.  We 
also  show  that,  in  mouse  cells,  Arg-DHFRts  is  targeted  by  both 
the  N-end  rule  pathway  and  another  proteolytic  system,  whereas 
in  yeast  the  same  test  protein  is  exclusively  an  N-end  rule 
substrate.  The  N-degron  of  Arg-DHFRts  is  the  first  example  of  a 


portable  degradation  signal  whose  activity  can  be  modulated  in 
vivo  by  a  cell-penetrating  compound. 

MATERIALS  AND  METHODS 

Plasmids  for  expression  in  S.  cerevisiae 

The  plasmid  pJJRtd,  which  expressed  Ub-Arg-DHFRts-ha 
(Fig.  1,  construct  I),  was  based  on  the  vector  pRS316  [20].  A 
1.0-kb  EcoRl/HindUl  fragment  of  pPW58  [8]  was  ligated  into 
pRS316,  followed  by  the  cloning  of  the  PCupi  promoter- 
containing  0.4  kb  EcoRVSac]  fragment  of  pPW48  [19]  into  the 
EcoRl  site,  a  step  that  yielded  pJJRtd.  pJJMtd,  which  expressed 
Ub-Met-DHFR,s-ha,  was  prepared  by  ligating  the  large  Age  1/ 
AflW  fragment  of  pJJRtd  to  a  small  fragment  from  the  AgeVAflW 
digest  of  pLGUbM-DHFRha  [21]. 

Plasmids  for  expression  in  mammalian  cells 

Mouse  DHFR  and  DHFRts  open  reading  frames  bearing  the 
desired  restriction  sites  at  their  5'  and  3'  ends  were  produced 
using  PCR.  All  DHFR  and  DHFRts  moieties  were  flanked,  at 
their  C-termini,  with  the  ha  epitope  [22]  (Fig.  1),  and  were 
cloned  into  the  pRc/CMV  vector  (Invitrogen,  San  Diego,  CA, 
USA). 

Constructs  II  and  III  (Fig.  1)  were  produced  by  PCR 
amplification  of  a  fragment  encoding  Met-DHFRts  and  Arg- 
DHFR^,  respectively,  using  construct  I  as  a  template.  The  N- 
terminal  Met  of  construct  II  was  specified  by  the  PCR  primer 


Fig.  1.  Test  proteins.  Fusions  used  in  this  work 
contained  some  of  the  following  moieties:  the  Ub 
moiety,  either  N-terminal  (constructs  I,  IV-VII) 
or  placed  between  other  moieties  (constructs  II 
and  III):  an  Arg  residue  at  the  junction  between 
Ub  and  the  C-terminal  part  of  the  fusion 
(constructs  I,  III,  V  and  VII);  a  Met  residue  at  the 
same  junction  (constructs  II,  IV,  and  VI);  a  42- 
residue  E.  colt  Lac  repressor-derived  sequence, 
termed  eAK  [extension  (e)  lacking  lysines  (AK)], 
between  Ub  and  the  reporter  part  of  the  fusion  (see 
the  main  text  and  [22,27])  (constructs  IV-VII); 
DHFRha,  a  mouse  DHFR  moiety  extended  at  the 
C-terminus  by  a  sequence  containing  the  hemag¬ 
glutinin-derived  ha  epitope  (constructs  I- VII); 
DHFR'\  a  mutant  temperature-sensitive  DHFR 
moiety  that  differs  from  DHFR  by  the  Pro  — *  Leu 
alteration  at  position  66  [19]  (see  the  main  text). 
Amino  acid  residues  are  indicated  by  their  single¬ 
letter  abbreviations.  Residues  that  vary  between 
constructs  are  boxed  in  the  sequences  above  the 
diagrams. 
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annealing  to  the  5'  end  of  the  fragment.  The  PCR  products  were 
digested  with  Xbal ,  blunted  with  Klenow  Pol  I,  followed  by  a 
cut  with  Sacll.  A  fragment  bearing  an  open  reading  frame  was 
inserted  between  the  blunt-ended  Notl  site  and  the  Sacll  site  of 
the  plasmid  pRc/dhaUbMbgal  [23],  resulting  in  the  plasmids 
pRc/dha-Ub-Met-DHFRts  and  pRc/dha-Ub-Arg-DHFRA  Met- 
and  Arg-DHFRts  contained  the  six-residue  sequence  His-Gly- 
Ser-Gly-Ue-Met  between  the  N-terminal  residue  and  Val,  the 
first  natural  residue  of  DHFR  (Fig.  1). 

To  produce  constructs  IV- VII  (Fig.  1),  a  fragment  encoding 
Arg-DHFRls  was  amplified  by  PCR,  using  construct  I  as 
template.  The  PCR  product  was  digested  with  Xbal,  and  blunted 
with  Klenow  Pol  I,  followed  by  a  Sacll  cut.  The  resulting 
fragment  was  inserted  between  the  blunt-ended  Notl  site  and  the 
Sacll  site  of  pRc/UbS(3gal,  yielding  pRc/Ub-Arg-DHFRts.  The 
latter  served  as  a  vector  for  the  constructs  IV- VII.  Constructs 
IV  and  V  were  obtained  by  amplifying  fragments  encoding, 
respectively,  Met-eAK-DHFRls  and  Arg-eAK-DHFRls,  using  the 
plasmid  pLG/UbV76-Val-eAKDHFRha  [22]  as  a  template.  The 
N-terminal  residues  Met  and  Arg  were  specified  by  a  PCR 
primer  annealing  to  the  5'  end  of  the  fragment.  Digestion  of  the 
PCR  products  with  Sacll  and  PflMl  yielded  a  fragment  coding 


for  Met-  or  Arg-eAK  and  the  first  39  residues  of  DHFR 
(DHFRi_39).  ( PflMl  cleaves  the  DHFRts  open  reading  frame 
upstream  of  the  mutated  codon  at  position  66.)  This  fragment 
was  inserted  in-frame  between  the  Sacll/ PflMl  sites  of  the 
plasmid  pRc/Ub-Arg-DHFRts,  yielding  plasmids  expressing  Ub- 
Met-eAK-DHFRts  and  Ub-Arg-eAK-DHFRls.  Constructs  VI  and 
VII  were  produced  by  exchanging  a  SnaBUPflml  fragment  of 
pRc/dhaUbNe  KDHFRha  with  the  SnaBUPflMl  fragments 
derived  from  constructs  IV  and  V,  respectively.  This  step 
eliminated  the  fragment  encoding  dha-Ub-Asn-eK-DHFR|^i3  of 
the  original  plasmid,  replacing  it,  in-frame,  with  the  sequence 
encoding  either  Ub-Met-eAK-DHFR ,  _  1 3  or  Ub-Arg-eAK- 
DHFRj  13.  (The  plasmid  pRc/dhaUbNe^DHFRha  was  con¬ 
structed  by  replacing  the  region  encoding  (3gal  in  pRc/ 
dhaUbX(3gal  [23]  with  the  region  encoding  DHFR.) 

Yeast  and  mouse  cell  cultures,  transfection,  and  pulse-chase  7. 
analysis 

S.  cerevisiae  were  grown  and  manipulated  as  described 
previously  [24]  and  in  the  legend  to  Fig.  2.  Mouse  L  cells,  a 
fibroblast-like  cell  line  (ATCC  CCL  1.3,  American  Type 
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Fig.  2.  Inhibition  of  Arg-DHFRts  degradation  by  MTX  in  S.  cerevisiae.  (A)  Lane  M,  molecular  mass  markers.  Lanes  a-c,  JD47-13c  ( UBR1 )  cells  expressing 
Arg-DHFR,S  (Ub-Arg-DHFR,S)  were  labeled  for  4  min  at  23  °C  with  [33S]methionine/cysteine,  followed  by  a  chase  for  10  and  30  min  as  described  [40],  then 
extraction,  immunoprecipitation,  and  Tricine-based  SDS/PAGE  (12%  gel)  of  Arg-DHFRts  [41].  Lanes  d-f,  same  as  lanes  a-c,  except  that  immediately  after  the 
4-min  pulse  cells  were  shifted  to  37  °C,  and  the  chase  was  carried  out  at  37  °C.  Lanes  g-i,  same  as  lanes  d-f,  but  an  independent  pulse-chase  experiment.  Lanes 
j-1,  same  as  lanes  g-i,  but  with  JD55  ( ubrlA )  S.  cerevisiae.  Lanes  m-o,  same  as  lanes  g-i,  but  with  KMY613  ( ubc2A )  S.  cerevisiae  strain  expressing  the 
plasmid-borne  UBC2  gene  [42].  Lanes  p-r,  same  as  lanes  m-o,  but  with  KMY613  ( ubc2A )  lacking  the  plasmid-borne  UBC2.  Lanes  s-u,  JD47-13c  ( UBR1 )  cells 
expressing  Arg-DHFR,s-ha-Cdc28p  (Ub-Arg-DHFR,s-ha-Cdc28p)  [19]  were  labeled  and  processed  as  described  for  lanes  d-f.  Lanes  v-x,  same  as  lanes  s-u,  but 
with  MTX  added  to  the  growth  medium  to  a  final  concentration  of  20  p,M  30  min  before  the  pulse.  (B)  Lane  M,  molecular  mass  markers.  Lanes  a-c,  JD47-13c 
( UBR1 )  cells  expressing  Met-DHFR,S  (Ub-Met-DHFRts)  were  labeled  for  4  min  at  23  °C  with  [35S] methionine/cysteine.  Immediately  thereafter,  the  cells  were 
shifted  to  37  °C,  followed  by  a  chase  at  37  °C  for  10  and  30  min,  extraction,  immunoprecipitation,  and  SDS/PAGE  of  Met-DHFRts.  Lanes  d-f,  same  as  lanes  a- 
c,  but  with  Arg-DHFRts.  Lanes  g-i,  same  as  lanes  a-c,  but  with  MTX  added  to  the  growth  medium  30  min  before  the  pulse  to  a  final  concentration  of  20  jxm. 
Lanes  j-I,  same  as  lanes  g-i,  but  with  Arg-DHFRts.  Lanes  m-o,  same  as  lanes  a-c,  except  that  both  the  pulse  and  the  chase  were  at  37  °C.  Lanes  p-r,  same  as 
lanes  m-o,  but  with  Arg-DHFR,S.  Lanes  s-u,  same  as  lanes  m-o,  but  with  MTX  added  to  the  growth  medium  30  min  before  the  pulse  to  a  final  concentration  of 
20  pM.  Lanes  v-x,  same  as  lanes  s-u,  but  with  Arg-DHFRts. 
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Culture  Collection,  Rockville,  MD,  USA),  were  grown  as 
monolayers  in  Dulbecco’s  modified  Eagle’s  medium/F12 
medium,  supplemented  with  10%  fetal  bovine  serum,  antibiotics, 
2  niM  L-glutamine  and  20  mM  Hepes  (sodium  salt;  pH  7.3).  The 
cultures  were  regularly  checked  for  the  absence  of  mycoplasmas. 
Transient  transfections  and  pulse-chase  analyses  were  per¬ 
formed  as  described  [23].  For  the  pulse-chases  at  23  °C,  cells 
were  transferred  to  this  temperature  30  min  before  labeling  for 
10  min  with  0.1  mCi  (1  Ci  =  37  GBq)  of  [3r,S]EXPRESS  (New 
England  Nuclear,  Boston,  MA,  USA).  Cells  were  chased  at 
either  23  °C  or  37  °C  in  the  complete  medium  containing 
0.1  mg-mL"1  cycloheximide.  All  media  were  pre-equilibrated  at 
the  desired  temperature.  In  experiments  that  involved  MTX,  the 
drug  was  added  30  min  before  labeling  and  was  present  at  a 
final  concentration  of  20  jxm  (diluted  from  20  mM  MTX  in 
0.9  m  potassium/sodium  phosphate,  pH  7.3).  The  samples  were 
analyzed  by  SDS/PAGE  (15%  gel),  followed  by  autoradio¬ 
graphy  and  quantitation  using  a  Phosphorlmager  (Molecular 
Dynamics,  Sunnyvale,  CA,  USA). 

To  distinguish  between  partial  half-lives  that  approximate  the 
slopes  of  different  regions  of  a  non-exponential  decay  curve,  a 
generalized  half-life  term,  5',  was  used,  where  0.5  denotes  the 
parameter’s  half-life  aspect  and  y-z  denotes  the  relevant  time 
interval,  from  y  to  z  min  of  chase.  Another  term  [23],  initial 
decay  (ID),  expressed  as  a  percentage,  equals  1  minus  the  ratio 
of  the  amount  of  a  radiolabeled  X-DHFR  test  protein  (normal¬ 
ized  against  the  reference  protein,  Met-DHFR-Ub;  Fig.  1 )  at  the 
end  of  the  pulse  (time  0)  to  the  (normalized)  amount  of  a 
radiolabeled  long-lived  X-DHFR  such  as  Met-DHFR.  As  ID 
may  depend  on  the  duration  of  a  pulse,  a  superscript,  in  ID', 
invokes  the  pulse  time  explicitly  [23]. 

RESULTS 

Inhibition  of  Arg-DHFRts  degradation  by  MTX  in 
S.  cerevisiae 

The  major  test  protein  of  this  work  was  Arg-DHFRts  (produced 
from  Ub-Arg-DHFR,S)  (Fig.  1,  construct  I),  identical  with  the 
Arg-DHFRts  moiety  of  a  larger  fusion  described  in  the  intro¬ 
duction.  The  test  proteins  bore  C-terminal  ha ,  a  hemagglutinin- 
derived  epitope  tag  recognized  by  a  monoclonal  antibody  [22] 
(Fig.  1 ;  the  ha  epitope  is  omitted  below  in  the  names  of  specific 
test  proteins).  For  the  analysis  in  S.  cerevisiae ,  Arg-DHFRts  was 
expressed  from  the  copper-inducible  P cun  promoter  in  a  low- 
copy  vector,  and  the  metabolic  fate  of  Arg-DHFRts  was  deter¬ 
mined  in  a  pulse-chase  assay  (see  Materials  and  methods). 
Similarly  to  the  previously  examined  Arg-DHFRts-Ura3p  and 
Arg-DHFRts-Cdc28p  [19],  Arg-DHFRls  was  a  long-lived  protein 
at  23  °C  (t0.5  >  4  h)  but  a  short-lived  protein  at  37  °C  (t{)5  ~ 
4  min)  (Fig.  2A,  lanes  a-c  vs.  d-f).  The  degradation  of  Arg- 
DHFRls  at  37  °C  was  carried  out  by  the  N-end  rule  pathway,  as 
indicated  by  (a)  the  requirement  for  Ubrlp  (E3),  the  recognition 
component  of  this  pathway  (Fig.  2A,  lanes  g-i  vs.  j-1),  (b)  the 
requirement  for  Ubc2p  (E2),  the  relevant  Ub-conjugating  enzyme 
(Fig.  2A,  lanes  m-o  vs.  p-r),  and  (c)  the  long  half-life  of  Met- 
DHFRts,  an  otherwise  identical  protein  that  bore  N-terminal 
Met,  a  stabilizing  residue,  instead  of  Arg  (Fig.  2B,  lanes  a-c  vs. 

d-f). 

Previous  work  [27]  has  shown  that  the  degradation  of  Arg-eK- 
DHFR  by  the  N-end  rule  pathway  in  an  extract  from  rabbit 
reticulocytes  can  be  inhibited  by  the  folate  analog  MTX,  a  high- 
affinity  DHFR  ligand  (Kd  «  10  pM).  It  was  far  from  clear 
whether  MTX  would  have  the  same  effect  in  vivo ,  in  part 
because  the  purified  N-end  rule  substrates  added  to  the  extract 


contained  the  folded  DHFR  moieties  [27].  By  contrast,  a  nascent 
DHFR-based  substrate  is  unable  to  bind  MTX  until  after  the 
folding  of  DHFR  [28],  but  can  be  targeted  by  the  N-end  rule 
pathway  in  vivo  at  any  time,  possibly  even  during  translation 
[16]. 

The  metabolic  fate  of  Arg-DHFRts  at  37  °C  was  monitored  in 
5.  cerevisiae  in  the  absence  or  presence  of  20  jxm  MTX  in  the 
medium.  MTX  was  found  to  be  an  efficacious  inhibitor  of  Arg- 
DHFRls  degradation  in  S.  cerevisiae  (Fig.  2B,  lanes  d-f  and  p-r 
vs.  j-1  and  v-x).  MTX  inhibited  the  degradation  of  Arg-DHFRts 
regardless  of  whether  it  was  added  to  the  growth  medium 
15  min  or  24  h  before  the  pulse-chase  assay  (data  not  shown; 
the  growth  of  cells  was  not  affected  significantly  by  20  jxm 
MTX  for  at  least  24  h).  The  protective  effect  of  MTX  on  Arg- 
DHFRts  was  essentially  the  same  regardless  of  whether  the  pulse 
labeling  (followed  by  the  37  °C  chase)  was  carried  out  at  23  °C 
(Fig.  2B,  lanes  d-f  vs.  j-1)  or  at  37  °C  (Fig.  2B,  lanes  p-r  vs. 
v-x). 

The  degradation  (at  37  °C)  of  Arg-DHFRts-Cdc28p,  which 
differed  from  Arg-DHFRts  (produced  from  Ub-Arg-DHFRls)  by 
the  presence  of  the  Cdc28p  kinase  moiety,  was  also  inhibited  by 
MTX  (Fig.  2A,  lanes  s-u  vs.  v-x).  Arg-DHFRls-Cdc28p,  which 
was  long-lived  at  23  °C,  or  at  37  °C  in  uhrlA  cells,  could 
function  as  a  Cdc28p  kinase  in  vivo  [19].  As  Arg-DHFRls- 
Cdc28p  was  short-lived  at  37  °C  in  UBRJ  S.  cerevisiae 
(Fig.  2A,  lanes  s-u),  the  cells  that  also  carried  a  mutant  (ts) 
cdc28  allele  [19]  were  not  viable  at  37  °C,  owing  to  the  absence 
of  the  essential  Cdc28p  kinase  at  this  temperature.  The  addition 
of  MTX  (at  20  |xm)  was  found  to  rescue  these  cells  at  37  °C 
through  the  inhibition  of  degradation  of  Arg-DHFRls-Cdc28p 
(Fig.  2A,  lanes  s-u  vs.  v-x,  and  data  not  shown).  The  effect  of 
MTX  was  specific  for  DHFR-containing  substrates,  as  degrada¬ 
tion  of  unrelated  test  proteins,  for  example  Arg-(3gal  (Ub-Arg- 
Pgal),  by  the  N-end  rule  pathway  was  unimpaired  in  the 
presence  of  MTX  (data  not  shown). 

DHFR  as  an  N-end  rule  substrate  in  mammalian  cells 

DHFR-based  substrates  of  the  N-end  rule  pathway  were  used  in 
the  earlier  studies  with  S .  cerevisiae  [16,19],  but  have  not  been 
examined  in  mammalian  cells.  We  used  L  cells,  a  mouse 
fibroblast-like  cell  line,  and  the  ubiquitin/protein/reference 
(UPR)  technique,  which  increases  the  accuracy  of  pulse-chase 
assays  [26]  by  providing  a  reference  protein  [23].  In  the  UPR 
technique,  Ub  is  located  between  a  protein  of  interest  and  a 
reference  protein  in  a  linear  fusion  (e.g.  construct  II  in  Fig.  1). 
This  fusion  is  cleaved,  cotranslationally  or  nearly  so,  by  UBPs  at 
the  last  residue  of  Ub,  producing  equimolar  amounts  of  the 
protein  of  interest  and  the  reference  protein  bearing  a  C-terminal 
Ub  moiety.  If  both  the  reference  protein  and  the  protein  of 
interest  are  immunoprecipitated  in  a  pulse-chase  assay,  the 
relative  amounts  of  the  protein  of  interest  can  be  normalized 
against  the  reference  in  the  same  sample.  The  UPR  technique 
can  thus  compensate  for  the  scatter  of  immunoprecipitation 
yields,  sample  volumes  and  other  sources  of  sample-to-sample 
variation  [23]. 

The  UPR  constructs  of  the  pm  ~>t  work  were  fusions 
containing  the  metabolically  stable  aPc^  HFR-Ub  moiety  as  a 
reference  protein  (termed  dha-T  f  4  ^FR-based  N-end 
rule  substrate  such  as,  for  exam}.  as  a  protein  of 

interest  (Fig.  1,  construct  III).  To  t  ssibility  that 

the  C-terminal  Ub  moiety  of  dha-L  "  function  as  a 
ubiquitylation/degradation  signal,  the  Ly.  residue  of  Ub  (a 
major  site  of  isopeptide  bonds  in  multi-Ub  chains  [17,22,29]) 
was  converted  into  Arg,  which  cannot  be  ubiquitylated  [12]. 
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Fig.  3.  Inhibition  of  Arg-DHFRts  degradation 
in  mouse  cells  by  MTX.  (A)  Lanes  a-d,  mouse  L 
cells  transiently  expressing  a  UPR-based  version 
of  Met-DHFRts  (Met-DHFR-ha-UbR4S-Met- 
DHFRts-ha;  see  Fig.  1,  construct  II)  were  labeled 
at  23  °C  for  10  min  with  [35S]methionine/ 
cysteine,  followed  by  a  chase,  also  at  23  °C, 
for  30,  60  and  120  min  in  the  presence  of 
cycloheximide,  extraction,  immunoprecipitation, 
and  SDS/PAGE  of  Met-DHFRls-ha  and 
Met-DHFR-ha-UbR48.  Lanes  e-h,  same  as  lanes 
a-d  but  with  Arg-DHFRls  (Met-DHFR-ha- 
UbR48-Arg-DHFRls-ha).  (B)  Lanes  a-d,  same  as 
lanes  a-d  in  (A),  except  that  both  the  pulse  and 
the  chase  were  at  37  °C.  Lanes  e-h,  same  as  lanes 
a-d,  but  with  Arg-DHFR's-ha  (Met-DHFR-ha- 
UbR48-Arg-DHFRls-ha).  (C)  Lanes  a-d,  same  as 
lanes  a-d  in  (B)  but  with  20  jxm  MTX  in  the 
medium.  Lanes  e-h,  same  as  lanes  a-d,  but  with 
Arg-DHFRts  (Met-DHFR-ha-UbR48-Arg-DHFR,s- 
ha).  The  bands  of  Arg/Met-DHFR,s-ha  and 
Met-DHFR-ha-UbR4X  (denoted  as  dha-Ub)  are 
indicated.  A  larger  area  of  the  gel  is  shown  for 
lanes  a-h  in  (B),  to  illustrate  the  overall 
immunoprecipitation  pattern. 


These  fusions  were  expressed  from  the  cytomegalovirus  early 
promoter,  P CMV,  in  mouse  L  cells.  The  cells  were  labeled  for 
10  min  at  either  23  °C  or  37  °C  with  [35S] methionine/cysteine, 
followed  by  a  chase  of  30,  60  and  120  min  at  the  same  tem¬ 
perature  in  the  presence  of  cycloheximide,  immunoprecipitation 
with  anti-ha  monoclonal  antibody,  and  analysis/quantitation  of 
immunoprecipitated  proteins  by  SDS/PAGE  and  Phosphor- 
Imager,  using  UPR  (see  Materials  and  methods).  Pulse-chase 
assays  in  which  cycloheximide  (a  translation  inhibitor)  was 
omitted  yielded  similar  results  (data  not  shown). 

Both  Met-DHFRts  and  Arg-DHFRts  were  long-lived  in  mouse 
cells  at  23  °C  (/().5  >  10  h)  (Fig.  3 A,  lanes  a-h,  and  Fig.  4A). 
By  contrast,  Arg-DHFRts  was  short-lived  at  37  °C  in  mouse 
cells:  its  half-life,  determined  between  0  and  60  min  of  chase 
and  denoted  as  fo.s°"6°  [23],  was  ~  10  min  (Fig.  3B,  lanes  e-h, 
and  Fig.  4A).  In  addition,  a  large  fraction  (*»  30%)  of  the  pulse- 
labeled  Arg-DHFRts  (but  not  of  Met-DHFRts)  was  degraded 
during  the  10-min  pulse  at  37  °C,  as  could  be  seen  from 
comparing  the  UPR-normalized  amounts  of  Arg-DHFRts  at  time 
0  (the  beginning  of  the  chase)  at  23  °C  and  37  °C  (Fig.  4A).  In 
contrast  with  a  conventional  pulse-chase  assay,  the  use  of  UPR 
in  a  setting  where  a  protein  can  be  made  either  short-lived  or 
long-lived  allows  the  detection  and  measurement  of  proteolysis 
not  only  during  the  chase  but  during  the  pulse  as  well  [23].  The 
extent  of  degradation  of  a  protein  during  the  pulse  is  denoted  as 
IDV  [23].  In  the  present  context,  the  variable  IDV  is  defined  as  the 
extent  of  degradation  of  a  radiolabeled  short-lived  protein  (Arg- 
DHFRts  at  37  °C)  at  the  end  of  a  pulse  relative  to  the  amount  of 
a  nearly  identical  protein  (Met-DHFRts  at  37  °C)  that  is  long- 
lived  under  the  same  conditions. 

A  large  fraction,  but  not  all,  of  the  Arg-DHFRts  degradation  in 
mouse  cells  was  earned  out  by  the  N-end  rule  pathway.  This 
could  be  seen  by  comparing  the  UPR-based  decay  curves  of 
Met-DHFRts  and  Arg-DHFRts  at  37  °C  (Fig.  3B  lanes  a-d  vs. 
lanes  e-h  and  Fig.  4A).  Specifically,  not  only  Arg-DHFRts  but 
also  Met-DHFRts  were  metabolically  unstable  at  37  °C  in  mouse 
cells,  the  corresponding  /fe60  being  «  1 0  min  and  ~  48  min, 
respectively  (Fig.  4A).  As  Met  is  a  stabilizing  residue  in  the 


N-end  rule  [23,30],  we  infer  that  Arg-DHFRts  was  targeted  not 
only  by  the  N-end  rule  pathway,  but  also  by  another  proteolytic 
system,  which  recognized  a  structural  feature  that  resulted  from 
a  conformational  perturbation  of  the  DHFRts  moiety  at  37  °C. 
This  aspect  of  Arg-DHFRts  acted  as  a  degron  in  mouse  cells  but 
not  in  yeast,  inasmuch  as  the  same  protein  was  targeted 
exclusively  by  the  N-end  rule  pathway  in  S.  cerevisiae  at  37  °C 
(Fig.  2A  lanes  g-i  vs.  lanes  j-1).  The  nature  of  this  degradation 
signal  in  the  conformationally  perturbed  (at  37  °C)  DHFRts 
moiety  is  not  known. 

Inhibition  of  both  degradation  signals  of  Arg-DHFRts-ha  by 
MTX  in  mouse  cells 

Mouse  L  cells  expressing  Met-DHFRts  (dha-Ub-Met-DHFRts)  or 
Arg-DHFRts  (dha-Ub-Arg-DHFRts)  were  incubated  at  37  °C  for 
30  min  with  20  jxm  MTX  in  the  medium,  followed  by  a  10-min 
pulse  with  [35S]methionine/cysteine,  a  chase,  and  SDS/PAGE 
analysis.  The  degradation  of  Met-DHFRts  at  37  °C,  which  is 
mediated  by  a  non-N-degron  in  the  conformationally  perturbed 
DHFRts  moiety  (see  above),  was  virtually  completely  inhibited 
by  MTX  (Fig.  3B  lanes  a-d  vs.  Fig.  3C  lanes  a-d  and  Fig.  4B). 
Specifically,  the  t$ '560  of  Met-DHFRts  at  37  °C  was  ~  48  min  in 
the  absence  of  MTX  and  >  1 0  h  in  the  presence  of  MTX 
(Fig.  4B).  The  much  more  rapid  degradation  of  Arg-DHFRts 
carried  out  by  both  the  N-end  rule  pathway  and  the  other 
proteolytic  pathway  was  strongly  but  incompletely  inhibited  by 
MTX  (Fig.  3B  lanes  e-h  vs.  Fig.  3C  lanes  e-h  and  Fig.  4B). 
Specifically,  in  the  absence  of  MTX,  the  rfe60  of  Arg-DHFRls 
was  ~  10  min  (ID10  of  36%)  (Fig.  4B).  In  the  presence  of 
MTX,  these  variables  changed  to  t^0  ~  37  min  and  ID10  of 
29%  (Fig.  4B). 

A  comparison  of  the  corresponding  decay  curves  (Fig.  4), 
made  more  reliable  by  the  increased  accuracy  of  a  UPR-based 
pulse-chase  assay,  suggested  one  reason  for  the  leaky  inhibition 
of  the  N-degron  of  Arg-DHFRts  by  MTX.  Specifically,  the  ID10 
of  Met-DHFRts  (i.e.  the  extent  of  Met-DHFRts  degradation 
during  the  pulse)  was  <  10%  in  the  absence  of  MTX,  whereas 
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Fig.  4.  Decay  curves  of  Met-DHFRts  and  Arg-DHFRts  at  23  °C  and 
37  °C  in  mouse  cells,  in  the  absence  (A)  and  presence  of  MTX  (B).  These 
curves  were  derived  from  the  UPR-based  electrophoretic  data  (Figs  3  and  5 
and  analogous  evidence),  as  described  in  Materials  and  methods. 


the  ID10  of  Arg-DHFRts  under  the  same  conditions  was  ~  36%. 
In  other  words,  more  than  one-third  of  labeled  Arg-DHFRls 
molecules  were  degraded  during  the  10-min  pulse  (Fig.  4B). 
Moreover,  the  ID10  of  Arg-DHFRts  remained  high  (~  30%) 
even  in  the  presence  of  MTX  (Fig.  4B).  This  result  suggested 
that  a  large  fraction  of  the  newly  formed  Arg-DHFRLs  molecules 
was  targeted  for  degradation  by  the  N-end  rule  pathway  before 
the  conformation  of  Arg-DHFR,S  was  mature  enough  to  allow 
high-affinity  binding  of  MTX  (see  the  Discussion). 

An  exposed  N-terminus  renders  Arg-DFIFRts  constitutively 
short-lived 

To  address  the  relative  contributions  of  the  first  (N-terminal 
Arg)  and  the  second  (internal  Lys)  determinant  of  the  N-degron 
in  Arg-DHFRts,  we  extended  its  N-terminus  with  a  42-residue, 
Escherichia  coli  Lac  repressor-derived  sequence,  termed  e 
[extension  (e)  lacking  lysines  (AK)]  [16].  Met-eAK-DHFRts  (Ub- 
Met-eAK-DHFRts)  (Fig.  1,  construct  IV),  which  bore  a  stabiliz¬ 
ing  N-terminal  residue,  was  long-lived  at  23  °C  in  mouse  cells 
(//)L60  >  10  h)  (Fig.  5 A,  lanes  a-c).  Arg-eAK-DHFRls  (Ub-Arg- 
eA^-DHFRts)  (Fig.  1,  construct  V),  which  bore  a  destabilizing 
N-terminal  residue,  was  short-lived  at  23  °C  (fo.560  ~  45  min; 
Fig.  5A,  lanes  d-f),  in  contrast  with  the  otherwise  identical  Arg- 
DHFRls  that  lacked  the  eAK  extension  and  was  long-lived  at 
23  °C  (Fig.  3A,  lanes  e-h).  Note  that  Arg-eAK-DHFRts  was 
short-lived  at  23  °C,  in  spite  of  the  fact  that  the  eAK  extension 
did  not  contribute  additional  Lys  residues  to  Arg-DHFRts,  which 
was  long-lived  at  23  °C. 

At  37  °C,  Met-eAK-DHFRts  was  approximately  as  short-lived 
as  Met-DHFRES;  the  degradation  of  these  proteins  was  carried 
out  by  a  non-N-end  rule  pathway  that  targeted  a  degron  (dx)  in 
the  conformationally  perturbed  DHFRts  moiety  (Fig.  5B,  lanes 
a-c;  compare  with  Fig.  3B,  lanes  a-d).  In  contrast,  the 
degradation  of  Arg-eAK-DHFRts  at  37  °C  was  much  faster  than 
that  of  Met-eAK-DHFRts,  the  difference  in  rate  being  due  to  the 
targeting  of  Arg-eAK-DHFRts  by  both  the  N-end  rule  pathway 


37°C 


37°C  +  MTX 


X-eAK-DHFRts 


Met  Arg 

1 

D  1 - 

Met 

Arg 

1  v'  1  Met 

Arg  1 

0  60  120"  0  60  1201 

T 

60  120 

11  0  60  1201  1  0  60 

120"  0  60  1201 

abode 

t 

a 

b  c 

d  e 

f  a  b 

c  d  e  f 

Temperature 

37°C 

c  f _ 

37°C  +  MTX 

U  1 — 

Met 

Arg  1 

t  1 

Met  Arg 

Chase  (min)  1  0 

60  120**  0 

60  1201 

1  0 

60  12011  0  60 

T201 

X-eAK-DHFR  -  *** 

'**“"*•• 

a 

b 

c  d 

e  f 

a 

b  c  d  e 

f 

Fig.  5.  Effects  of  temperature  and  MTX  on  the  degradation  of  N-terminally  extended  DHFRts  and  DHFR  in  mouse  cells.  (A)  Lanes  a— c,  mouse  L-cells 
transiently  expressing  Met-eAK-DHFR,s  (Fig.  1 ,  construct  IV)  were  labeled  at  23  °C  for  10  min  with  \?>S] methionine/cysteine,  followed  by  a  chase,  also  at  23  °C, 
for  60  and  1 20  min  in  the  presence  of  cycloheximide,  extraction,  immunoprecipitation,  and  SDS/PAGE  analysis.  Lanes  d-f,  same  as  lanes  a-c  but  with  Arg-e  - 
DHFR,S.  (B)  Lanes  a-f,  same  as  lanes  a-f  in  (A),  but  the  pulse-chase  was  carried  out  at  37  °C.  (C)  Lanes  a-c,  same  as  lanes  a-c  in  (B),  but  the  pulse-chase  of 
Met-eAK-DHFR,s  was  carried  out  in  the  presence  of  20  jxm  MTX  in  the  growth  medium.  Lanes  d-f,  same  as  lanes  a-c,  but  with  Arg-eAk-DHFRls.  (D)  Lanes  a-c, 
same  as  lanes  a-c  in  (B),  but  with  Met-eAK-DHFR  (the  wild-type  DHFR  moiety).  Lanes  d-f,  same  as  lanes  a-c,  but  with  Arg-eAK-DHFR.  The  asterisk  indicates 
a  cleavage  product  derived  from  a  minority  of  short-lived  Arg-eAK-DHFR5"  and  Arg-eAK-DHFR  molecules  during  their  targeting  by  the  N-end  rule  pathway  (see 
the  main  text).  (E)  Lanes  a-c,  same  as  lane  a-c  in  (D),  but  in  the  presence  of  20  puvi  MTX.  Lanes  d-f,  same  as  lanes  a-c  but  with  Arg-eAK-DHFR. 
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and  the  dx-recognizing  pathway  (Fig.  5B,  lanes  d-f).  A  minor 
proteolytic  fragment  of  Arg-eAK-DHFRls,  visible  in  Fig.  5B-E 
(lanes  d-f),  was  specific  to  N-end  rule  substrates  bearing  an 
eAK-type  extension;  an  analogous  fragment  was  also  observed 
with  X-eK-|3  gal-based  N-end  rule  substrates  [16]. 

We  also  examined  the  metabolic  fate  of  Met-eAK-DHFR  (Ub- 
Met-eAK-DHFR)  and  Arg-eAK-DHFR  (Ub-Arg-eAK-DHFR), 
which  were  identical  with  the  substrates  above  except  that  they 
bore  the  wild-type  mouse  DHFR  moiety  (Fig.  1,  constructs  VI 
and  VII).  Met-eAK-DHFR  was  long-lived  at  both  23  °C  and 
37  °C  in  mouse  cells  (Fig.  5D,  lanes  a-c,  and  data  not  shown), 
indicating  that  the  heat-activated  dx-type  degron  of  substrates 
containing  the  DHFRls  moiety  (e.g.  Fig.  4A)  resulted  from  the 
Pro66  gr;  Leu66  mutation  that  yielded  this  moiety.  The 
degradation  of  Arg-eAK-DHFR  was  mediated  exclusively  by 
the  N-end  rule  pathway  (Fig.  5D,  lanes  d-f  vs.  lanes  a-c), 
unlike  the  double-pathway  degradation  of  Arg-eAK-DHFRts  at 
37  °C  (Fig.  5C,  lanes  d-f).  In  contrast  with  the  results  with  Arg- 
DHFRts  and  Met-DHFRls,  the  presence  of  MTX  did  not  result  in 
a  significant  stabilization  of  Arg-eAK-DHFR  (Fig.  5D  lanes  d-f 
vs.  Figure  5E  lanes  d-f). 

DISCUSSION 

We  report  the  following  results. 

(a)  Arg-DHFRts  is  long-lived  at  23  °C  but  short-lived  at  37  °C 
in  both  S .  cerevisiae  and  mouse  cells. 

(b)  In  yeast,  Arg-DHFRls  is  degraded  (at  37  °C)  exclusively 
by  the  N-end  rule  pathway,  whereas  in  mouse  cells  the  same 
protein  at  the  same  temperature  is  degraded  by  another 
proteolytic  pathway  as  well.  The  corresponding  mouse- specific 
degron  is  active  at  37  °C  but  inactive  at  23  °C.  This  degradation 
signal  is  a  currently  unknown  feature  of  the  perturbed  con¬ 
formation  of  the  DHFRls  moiety  at  37  °C,  which  differs  from  the 
wild-type  DHFR  moiety  by  a  Pro  — ►  Leu  alteration  at  position 
66. 

(c)  MTX,  a  low-A7r  ligand  of  DHFR  ( Kd  —  10  pM),  inhibits 
the  degradation  of  Arg-DHFRls  (at  37  °C)  nearly  completely  in 
yeast  and  partially  in  mouse  cells. 

(d)  The  ability  of  MTX  to  inhibit  the  degradation  of  Arg- 
DHFRls  in  vivo  is  retained  when  Arg-DHFRls  is  utilized  as  a 
portable  degron  that  confers  metabolic  instability  on  linked 
unrelated  proteins  in  yeast. 

(e)  Using  the  UPR  technique  [23]  to  determine  the  metabolic 
fate  of  Arg-DHFRts  in  mouse  cells,  we  found  that  —  36%  of  the 
labeled  Arg-DHFRts  molecules  are  degraded  during  the  10-min 
pulse,  and  that  the  presence  of  MTX  decreases  this  fraction  only 
slightly,  to  —  30%.  This  finding  accounts  for  the  leaky  inhi¬ 
bition  of  Arg-DHFRts  degradation  by  MTX  in  mouse  cells. 
Specifically,  this  finding  suggests  that  a  large  fraction  of  the 
nascent  Arg-DHFR,S  is  targeted,  in  mouse  cells,  by  the  N-end 
rule  pathway  before  the  conformation  of  the  DHFRts  moiety  is 
mature  enough  [28]  to  allow  high-affinity  binding  of  MTX.  That 
the  MTX-mediated  inhibition  of  degradation  of  Arg-DHFRts  in 
yeast  is  much  less  leaky  than  in  mouse  cells  is  but  one  difference 
in  the  detailed  properties  of  the  yeast  and  mammalian  N-end 
rule  pathways  [see  also  item  (f)]. 

(f )  When  the  N-terminus  of  Arg-DHFRts  was  extended  with  a 
42-residue  lysine-lacking  extension,  termed  eAK,  the  N-degron 
of  the  resulting  Arg-eAK-DHFRts  was  active  at  both  23  °C  and 
37  °C,  unlike  the  mouse-specific  degron,  which  remained 
inactive  at  23  °C.  Moreover,  the  degradation  of  Arg-eAK- 
DHFRls,  in  contrast  with  that  of  Arg-DHFRls,  could  not  be 
inhibited  by  MTX,  suggesting  that  the  inactivity  of  the  N-degron 
in  Arg-DHFRls  at  23  °C  results  at  least  in  part  from  inaccessibility 


of  its  N-terminal  arginine  to  the  targeting  complex  of  the  N-end 
rule  pathway.  Arg-eAK-DHFR,  which  bore  the  wild-type  DHFR 
moiety,  was  also  short-lived  in  mouse  cells.  Previously,  the 
same  protein  was  found  to  be  long-lived  in  yeast,  unless  the  eAK 
extension  was  replaced  by  an  otherwise  identical  extension, 
termed  eK,  which  contained  Lys  residues  that  functioned  as  the 
second  determinant  of  the  N-degron  [16]. 

Potential  applications  of  a  heat-inducible,  MTX-suppressible 
portable  N-degron  include  its  use  to  produce  conditional 
mutants  in  homeothermic  animals  such  as  mouse.  One  difficulty 
in  the  current  approaches  to  this  problem  [31-33]  is  that  either  a 
deletion  or  transcriptional  repression  of  a  gene  of  interest  leaves 
the  previously  produced  gene  product  unperturbed.  If  this 
protein  is  long-lived  and  resides  in  a  non-dividing  cell,  there 
may  be  a  considerable  phenotypic  lag  between  the  conditionally 
introduced  genetic  change  and  the  actual  inactivation  or  dis¬ 
appearance  of  a  corresponding  protein.  A  degron  that  can  be 
regulated  by  a  cell-penetrating  ligand  can  be  employed  to 
address  this  problem.  Since  chronic  administration  of  MTX  is 
toxic  to  mammals,  it  will  be  necessary  to  construct  an  analogous 
DHFR-based  N-degron  that  uses  E.  coll  DHFR  and  is  inhibited 
by  trimethoprim,  which  binds  tightly  to  the  E.  coli  but  not  the 
mouse  DHFR  [34].  If  a  portable  N-degron  of  this  kind  could  be 
constructed,  combining  it  with  a  conditional  repression  of  a  gene 
of  interest  should  yield  better  methods  for  producing  conditional 
mutants  in  homeothermic  animals. 

MTX,  through  its  binding  to  the  substrate  pocket  of  DHFR, 
stabilizes  DHFR  conformation  [35,36].  This  effect  was 
employed  in  several  studies  with  DHFR  as  a  reporter  protein. 
For  example,  the  binding  of  MTX  to  DHFR  can  preclude,  under 
certain  conditions,  the  translocation  of  a  DHFR-containing 
fusion  protein  across  biological  membranes  [37,38].  MTX  can 
partially  protect  DHFR  against  in  vitro  proteolysis  by  thermo - 
lysin  [38,39].  MTX  has  also  been  shown  to  block  the  degra¬ 
dation  of  a  targeted  multiubiquitylated  Arg-eK-DHFR  in 
reticulocyte  extract  [27].  In  the  latter  study,  the  N-end  rule 
pathway  in  reticulocyte  extract  was  presented  with  a  prefolded 
wild-type  DHFR  moiety.  By  contrast,  in  the  present  work  with 
intact  cells,  the  same  pathway  could  target  a  DHFR-based  N-end 
rule  substrate  immediately  after  (and  possibly  even  during)  its 
synthesis.  Further  analysis  is  required  to  clarify  the  apparent 
discrepancy  between  the  previously  observed  inhibition  of 
degradation  of  Arg-eK-DHFR  by  MTX  in  vitro  (in  reticulocyte 
extract)  [27]  and  the  absence  of  a  comparable  effect  of  MTX  in 
vivo  on  this  class  of  substrates,  in  which  N-terminal  Arg  is 
constitutively  exposed.  One  difference  between  the  two  settings 
is  the  presentation  of  prefolded  DHFR  moieties  in  reticulocyte 
extract  compared  with  a  kinetic  competition  between  the  folding 
of  newly  formed  DHFR  moieties  and  their  targeting  for 
degradation  by  the  N-end  rule  pathway. 

The  use  of  the  UPR  technique  not  only  increased  the  overall 
accuracy  of  pulse-chase  assays  with  mouse  cells  but  also 
showed  that  ~  30%  of  the  newly  made  Arg-DHFRts  was 
degraded  during  the  pulse.  The  finding  that  the  fraction  of  Arg- 
DHFRts  degraded  during  the  pulse  decreased  only  slightly  in  the 
presence  of  MTX  strongly  suggests  that  most  of  the  labeled  Arg- 
DHFRts  is  conformationally  immature  shortly  after  synthesis, 
and  therefore  cannot  form  a  high-affinity  complex  with  MTX. 
The  Pro  — ►  Leu  alteration  at  position  66  that  yielded  DHFRts 
occurred  in  the  region  of  DHFR  that  interacts  with  the  aromatic 
ring  of  MTX  [35],  and  is  likely  to  have  resulted  in  a  decreased 
affinity  of  DHFR  for  MTX.  Nonetheless,  the  test  proteins 
containing  the  DHFRts  moiety  were  efficiently  retained  on  an 
MTX  affinity  column  (data  not  shown),  and  their  in  vivo 
degradation  could  be  specifically  inhibited  by  MTX. 
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A  model  of  targeting  by  the  N-end  rule  pathway  that  accounts 
for  our  findings  with  the  heat-inducible  N-degron  assumes  that 
the  main  cause  of  metabolic  stability  of  Arg-DHFRts  at  23  °C  is 
not  the  absence  of  a  sterically  accessible  Lys  residue  within 
the  folded  DFIFR  globule,  but  rather  a  steric  inaccessibility  of 
the  N-terminal  Arg  residue.  In  this  model,  the  activation  of  the 
previously  cryptic  N-degron  of  Arg-DHFR,S  at  37  °C  is  caused 
by  an  increased  exposure  and  flexibility  of  a  region  bearing  the 
N-terminal  Arg,  a  transition  that  leads  to  the  binding  of  Arg  by 
the  targeting  complex  of  the  N-end  rule  pathway.  According  to 
this  interpretation,  MTX  stabilizes  Arg-DHFR'S  against  degra¬ 
dation  at  37  °C  not  by  precluding  the  conformational  mobili¬ 
zation  of  internal  Lys  residues,  but  by  precluding  a  temperature- 
mediated  increase  in  the  exposure  of  the  N-terminal  Arg.  In 
addition  to  being  consistent  with  the  available  evidence,  this 
model  accounts  for  the  otherwise  puzzling  result  that  Arg-eAK- 
DHFR's-ha  (which  differs  from  Arg-DHFRN  by  the  presence  of 
eAK)  is  short-lived  at  both  23  °C  and  37  °C,  whereas  Arg- 
DHFRts  is  short-lived  only  at  37  °C.  Indeed,  the  eAK  extension 
would  result  in  a  temperature-independent  enhanced  exposure  of 
N-terminal  Arg,  a  change  that  would  be,  in  this  model,  sufficient 
for  the  temperature- independent  activity  of  the  N-degron  in  Arg- 
eAK-DHFRts.  The  earlier  model  proposed  by  Dohmen  et  al .  [19], 
which  presumed  that  the  inactivity  of  the  N-degron  in  Arg- 
DHFR,S  at  23  °C  was  caused  by  the  absence  of  sterically 
accessible  Lys  residues,  cannot  account  for  the  above  result. 

How  can  we  verify  the  Arg-exposure  model?  One  way  is 
suggested  by  the  existence,  in  both  yeast  and  mammals,  of  the 
enzymes  N-terminal  amidase  (Nt-amidase)  and  Arg-tRNA- 
protein  transferase  (R-transferase),  which  chemically  modify 
specific  (sterically  exposed)  N-terminal  residues  of  proteins  in 
the  cytosol.  In  S.  cerevisiae ,  the  NTA /-encoded  Nt-amidase 
deamidates  N-terminal  Asn  or  Gin;  the  A  77E7 -encoded  R- 
transferase  conjugates  Arg  to  N-terminal  Asp  or  Glu  (reviewed 
in  [10]).  A  test  of  the  model  would  involve  the  expression  of,  for 
example,  Asn-DHFRts  (identical  to  Arg-DHFR,S  except  for  the 
presence  of  N-terminal  Asn)  in  yeast  cells  that  lack  Ubrlp,  a 
component  of  the  N-end  rule  pathway  that  recognizes  primary 
destabilizing  N-terminal  residues  such  as  Arg.  (The  absence  of 
Ubrlp  would  preclude  the  degradation  of  test  proteins.)  The 
model  predicts  that  at  23  °C  the  N-terminal  Asn  of  Asn-DHFRts 
would  be  deamidated  inefficiently  or  not  at  all,  in  contrast  with 
what  happens  at  37  °C.  This  can  be  verified  by  isolating  Asn- 
DHFR,S  from  cells  incubated  at  either  23  °C  or  37  °C,  and 
determining  its  N-terminal  residue.  If  the  model  is  correct,  the 
N-terminal  sequence  must  begin  largely  or  entirely  with  Asn  at 
23  °C,  but  would  become  Arg-Asp-...  at  37  °C.  (In  the  latter 
case,  the  N-terminal  Asp,  produced  from  Asn,  would  be 
arginylated  by  R-transferase.)  Experimental  verification  of  this 
model  is  under  way. 
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ABSTRACT  The  insufficient  selectivity  of  drugs  is  a  bane 
of  present-day  therapies.  This  problem  is  significant  for 
antibacterial  drugs,  difficult  for  antivirals,  and  utterly  un¬ 
solved  for  anticancer  drugs,  which  remain  ineffective  against 
major  cancers,  and  in  addition  cause  severe  side  effects.  The 
problem  may  be  solved  if  a  therapeutic  agent  could  have  a 
multitarget,  combinatorial  selectivity,  killing,  or  otherwise 
modifying,  a  cell  if  and  only  if  it  contains  a  predetermined  set 
of  molecular  targets  and  lacks  another  predetermined  set  of 
targets.  An  earlier  design  of  multitarget  drugs  [Varshavsky, 
A.  (1995)  Proc .  Natl.  Acad.  Sci .  USA  92,  3663-3667]  was 
confined  to  macromolecular  reagents  such  as  proteins,  with 
the  attendant  difficulties  of  intracellular  delivery  and  immu- 
nogenicity.  I  now  propose  a  solution  to  the  problem  of  drug 
selectivity  that  is  applicable  to  small  (<1  kDa)  drugs.  Two 
ideas,  codominant  interference  and  antieffectors,  should  allow 
a  therapeutic  regimen  to  possess  combinatorial  selectivity,  in 
which  the  number  of  positively  and  negatively  sensed  macro- 
molecular  targets  can  be  two,  three,  or  more.  The  nature  of  the 
effector  and  interference  moieties  in  a  multitarget  drug 
determines  its  use:  selective  killing  of  cancer  cells  or,  for 
example,  the  inhibition  of  a  neurotransmitter-inactivating 
enzyme  in  a  specific  subset  of  the  enzyme-containing  cells.  The 
in  vivo  effects  of  such  drugs  would  be  analogous  to  the 
outcomes  of  the  Boolean  operations  “and,”  “or,”  and  combi¬ 
nations  thereof.  I  discuss  the  logic  and  applications  of  the 
antieffector  and  interference/codominance  concepts,  and  the 
attendant  problem  of  pharmacokinetics. 


The  many  successes  of  pharmacology  (1,  2)  do  not  include  the 
problem  of  cancer.  Major  human  cancers  are  incurable  once 
they  have  metastasized.  A  few  relatively  rare  cancers,  such  as 
testicular  carcinoma  in  men,  Wilms’  kidney  tumor,  and  some 
leukemias  in  children,  can  often  be  cured  through  chemother¬ 
apy  but  require  cytotoxic  treatments  of  a  kind  that  cause  severe 
side  effects  and  are  themselves  carcinogenic  (2). 

The  main  reason  for  the  failure  of  cytotoxic  therapies  is  their 
insufficient  selectivity  for  tumors.  For  example,  treatments 
with  radiation  or  alkylating  agents  perturb  many  functions  that 
are  common  to  all  cells.  The  more  selective  cytotoxic  drugs,  for 
instance,  methotrexate,  taxol,  and  etoposide,  perturb  the 
functions  of  specific  macromolecular  targets  (dihydrofolate 
reductase,  microtubules,  and  topoisomerase  II),  but  these 
targets  are  present  in  both  normal  and  malignant  cells  (1,  2). 
Hence  the  low  therapeutic  index  of  anticancer  drugs  and  their 
systemic  toxicity  at  clinically  relevant  doses.  Because  the 
mitotic  activity  of  cells  in  a  tumor  is  often  lower  than  the 
mitotic  activity  of  normal  cells  in  self-renewing  tissues  such  as 
the  bone  marrow  (3),  one  might  not  have  expected  these  drugs 
to  work  at  all — to  have  any  preference  for  the  killing  of  cancer 
cells.  That  such  preference  actually  exists  stems  in  part  from 
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the  fact  that  tumor  cells  are  often  perturbed  by  their  mutations 
into  stress-hypersensitive  states.  Consequently,  these  tumor 
cells  die  an  apoptotic  death  at  the  level  of  a  drug-imposed 
metabolic  stress  that  induces  apoptosis  in  some  but  not  in  most 
of  the  organism’s  normal  cells  (3). 

With  some  cancers,  cytotoxic  therapies  are  ineffective  from  the 
beginning.  In  other  cases,  these  therapies  yield  a  partial,  some¬ 
times  clinically  complete,  but  almost  invariably  transient  remis¬ 
sion  of  a  cancer,  in  part  because  these  treatments  select  for  tumor 
cell  variants  that  retain  tumorigenicity  but  are  more  resistant  to 
either  apoptosis  per  se  or  a  drug  that  induces  apoptosis.  Because 
a  significant  increase  in  the  drug  or  radiation  dosage  is  precluded 
by  their  low  therapeutic  index,  these  therapies  become  ineffective 
when  resistant  clones  of  malignant  cells,  selected  by  a  drug 
treatment,  present  themselves  as  a  cancer  recurrence. 

The  failure  of  small  cytotoxic  drugs  to  produce  a  cure  for 
cancer  has  given  rise  to  other  strategies,  in  particular  the  insightful 
suggestion  that  solid  tumors  can  be  targeted  by  selectively  inhib¬ 
iting  neovascularization,  a  process  that  these  tumors  depend  on 
for  growing  to  a  clinically  significant  size  (4).  Another  approach, 
immunotoxins,  involves  the  linking  of  a  toxin  to  a  ligand  such  as 
an  antibody  or  a  growth  factor  that  binds  to  a  target  on  the  surface 
of  tumor  cells  (5).  Among  the  limitations  of  present-day  immu¬ 
notoxins  is  their  incapacity,  on  entering  a  cell,  to  adjust  their 
toxicity  in  response  to  the  intracellular  protein  composition.  Yet 
another  approach  is  to  enhance  the  ability  of  the  immune  system 
to  identity  and  selectively  destroy  tumor  cells.  The  current  revival 
of  this  strategy  holds  the  promise  of  a  rational  and  curative 
treatment  (6).  Given  the  complicated  regimens  and  unsolved 
problems  of  immunotherapies,  it  is  clear  that  this  and  other  recent 
approaches  (7,  8)  are  motivated  in  part  by  the  perception  that 
small-drug  pharmacology,  so  successful  against  bacterial  infec¬ 
tions,  is  unlikely  to  prove  effective  against  cancer.  In  contrast  to 
this  view,  the  premise  of  the  strategy  described  in  the  present 
work  is  that  small  anticancer  drugs  may  become  curative  and  free 
of  severe  side  effects  if  a  way  is  found  to  confer  on  these 
compounds  a  multitarget,  combinatorial  selectivity. 

Most  cancers  are  monoclonal:  cell  lineages  of  both  the  primary 
tumor  and  the  metastases  originate  from  a  single  founder  cell. 
This  cell  is  a  breakthrough  descendant  of  a  cell  lineage  that  has 
been  accumulating  mutations  for  some  time,  often  in  proximity 
to  other  neoplastic  but  still  nonmetastatic  cell  lineages  within  an 
indolent  proliferative  lesion  such  as  a  benign  tumor  (9, 10).  Given 
the  monoclonality  of  a  cancer,  cells  of  both  the  primary  tumor  and 
the  metastases  share  the  initial  mutations  that  yielded  the 
founder  cell,  even  if  these  cells  differ  at  other  loci  that  accumu¬ 
lated  mutations  in  the  course  of  the  later  tumor  progression. 
Some  of  the  early  mutations  are  in  genes  that  encode  tumor 
suppressors  (9,  11,  12).  In  most  cancers,  both  alleles  of  a  tumor 
suppressor  gene  are  inactivated,  sometimes  through  deletions 
that  encompass  the  gene  on  the  two  homologous  chromosomes. 
Thus,  a  monoclonal  cancer,  although  heterogeneous  genetically, 


Abbreviation:  IC,  interference/codominance. 
tTo  whom  reprint  requests  should  be  addressed  at:  Division  of 
Biology,  147-75,  Caltech,  1200  East  California  Boulevard,  Pasadena, 
CA  91125.  e-mail:  avarsh@cco.caltech.edu. 


2094 


Biochemistry:  Varshavsky 


Proc.  Natl  Acad.  Sci.  USA  95  (1998)  2095 


always  contains  a  set  of  founder  mutations  that  is  shared  by  all  of 
its  cells. 

A  drug  that  kills  a  cell  only  if  it  lacks  a  specific  macromo¬ 
lecular  target  would  distinguish  tumor  cells  from  many  other 
cells  of  an  organism,  provided  that  the  target  is  a  product  of 
a  gene  that  had  been  deleted  or  inactivated  in  this  cancer  at  the 
stage  of  its  founder  cell.  Such  a  drug  may  be  especially  selective 
against  cancers  that  lack  a  gene  for  a  ubiquitously  expressed 
tumor  suppressor,  for  example,  the  retinoblastoma  (Rb)  pro¬ 
tein  (11, 12).  An  example  of  the  negative-target  approach  is  the 
use  of  a  mutant  adenovirus  that  replicates  selectively  in  human 
cancer  cells  lacking  the  tumor  suppressor  p53  and  has  been 
shown  to  kill  these  cells  in  a  model  setting  (8). 

However,  other  tumor  suppressors  may  not  be  expressed  at 
comparable  levels  in  most  cells.  A  drug  that  kills  a  cell  if  it  lacks 
a  nonubiquitous  tumor  suppressor  would  be  toxic  to  a  subset 
of  normal  cells  as  well.  This  problem  could  be  reduced  through 
the  use  of  a  drug  that  is  toxic  only  if  a  cell  lacks  two  specific 
macromolecules,  termed  negative  targets.  Two  judiciously 
chosen  negative  targets  may,  together,  suffice  to  distinguish  all 
of  the  cancer  cells  from  all  of  the  organism’s  normal  cells.  If 
they  do  not,  a  third  negative  target  that  had  been  deleted  or 
rendered  defective  in  a  given  cancer  can  be  employed  as  well. 
This  strategy  requires  a  drug  that  possesses  the  ability  to  kill 
a  cell  if  it  lacks  two  or  more  of  the  predetermined  targets,  but 
would  spare  a  cell  containing  either  one  of  these  targets. 

Other  changes  in  a  founder  cell  may  involve  a  missense 
mutation,  an  amplification  and  overexpression,  an  ectopic 
expression,  or  a  translocation/fusion  of  a  specific  protoonco¬ 
gene  such  as,  for  example,  Ras  or  Myc  (9,  10,  13).  A  single 
oncoprotein  may  not  be  a  unique  enough  target  by  itself,  for 
reasons  similar  to  those  described  above  in  the  context  of 
negative  targets.  However,  a  combination  of  two  or  more 
distinct  oncoproteins  that  were  either  mutated  or  inappropri¬ 
ately  expressed  in  the  founder  cell  can  be  employed  to 
formulate  the  unique  multiprotein  signature  of  a  specific 
cancer  that  comprises  both  positive  and  negative  targets. 

These  considerations  suggest  that  a  conditionally  cytotoxic 
therapeutic  regimen  that  is  exquisitely  specific  for  a  given 
cancer,  and  therefore  would  eliminate  it  without  significant 
side  effects,  must  possess,  in  most  cases,  a  multitarget,  com¬ 
binatorial  (positive/negative)  selectivity  of  the  kind  defined 
above.  Conversely,  even  an  informed  choice  of  the  molecular 
target  for  a  single-target  drug  may  not  suffice  to  define 
unambiguously  the  cell  type  to  be  eliminated.  Note  that  simply 
combining  two  single-target  drugs  against  two  different  targets 
in  a  multidrug  regimen  would  not  yield  a  multitarget  selectiv¬ 
ity,  because  the  two  drugs  together  would  perturb  not  only  cells 
containing  both  targets  but  also  cells  containing  either  one  of 
the  targets. 

Although  the  problem  of  insufficient  selectivity  is  not  as 
acute  with  noncytotoxic  drugs,  it  is  relevant  to  them  as  well. 
Among  the  multitude  of  examples  are  side  effects  of  therapies 
with  antipsychotic  agents.  The  side  effects  are  caused  in  part 
by  the  insufficient  molecular  specificity  of  drugs,  which  is 
exemplified  by  the  ability  of  antidepressants  that  inhibit  mono¬ 
amine  oxidase  to  perturb  other  proteins  as  well  (2).  This 
difficulty  will  continue  to  abate  with  the  development  of  more 
specific  single-target  inhibitors.  But  an  entirely  distinct,  major, 
and  unsolved  problem  with  inhibitors  as  drugs  is  the  current 
impossibility  of  restricting  their  action  to  a  specific  subset  of 
cells  among  those  that  contain  the  inhibitor’s  target.  For 
example,  even  an  exquisitely  specific  inhibitor  of  a  clinically 
relevant  enzyme  is  likely  to  have  side  effects,  because  the 
target  enzyme  is  present,  in  most  cases,  not  only  in  the  cells 
where  its  inhibition  is  clinically  beneficial  but  also  in  the  cells 
where  its  inhibition  is  physiologically  inappropriate.  The 
present  work  describes  a  possible  solution  of  this  problem. 

A  previously  proposed  approach  to  designing  multitarget  drugs 
utilized  degradation  signals  (degrons)  and  analogous  signals  that 


exhibit  the  property  of  codominance  (14,  15).  As  a  result,  this 
strategy  was  confined  to  macromolecular  reagents  such  as  pro¬ 
teins,  with  the  attendant  problems  of  immunogenicity,  extrava¬ 
sation,  and  intracellular  delivery.  The  latter  difficulty  is  especially 
significant,  because  either  gene-therapy  or  direct-delivery  meth¬ 
ods  for  introducing  large  molecules  into  cells  work  reasonably 
well  with  cells  in  culture  but  are  still  inefficient  with  cells  in  an 
intact  organism.  The  challenge,  then,  is  to  attain  a  multitarget, 
combinatorial  selectivity  in  the  setting  of  small  (^1  kDa)  drugs, 
where  the  immunogenicity  and  delivery  problems  are  less  severe. 

A  solution,  described  below,  invokes  a  modification  of  the  earlier 
idea  of  codominant  interference  (14)  in  conjunction  with  the  new 
concept  of  antieffectors.  This  solution  is  applicable  to  either 
cytotoxic  or  noncytotoxic  therapies. 

RESULTS  AND  DISCUSSION 

Multitarget  Compounds  Specific  for  Negative  Targets:  The 
Concept  of  Codominant  Interference.  Previous  work  (14) 
suggested  that  the  property  of  codominance,  characteristic  of 
degradation  signals  (degrons)  and  many  other  signals  in 
biopolymers,  can  be  employed  to  design  protein-based  re¬ 
agents  that  possess  multitarget,  combinatorial  selectivity  of  the 
kind  defined  above.  Codominance  refers  to  the  ability  of  two 
or  more  signals  in  the  same  molecule  to  function  indepen¬ 
dently  and  not  to  interfere  with  each  other.  It  is  shown  below 
that  a  distinct  version  of  the  interference/codominance  (IC) 
concept  (14)  is  applicable  to  small  (<1  kDa)  compounds. 
Consider  a  reagent  containing  three  small  moieties  a,  b,  and 
i,  which  can  bind,  respectively,  to  three  macromolecular  targets 
A,  B,  and  I.  Because  the  moieties  a,  b,  and  i  are  much  smaller 
than  the  macromolecules  A,  B,  and  I,  it  should  be  possible  to 
arrange  these  moieties  in  the  compound  abi  in  such  a  way  that 
the  binding  of  A  or  B  to  a  or  b  would  preclude,  through  steric 
hindrance,  the  binding  of  moiety  i  to  I  (Fig.  1). 

That  the  interactions  of  a  small  bipartite  compound  with  its 
two  macromolecular  ligands  can  be  made  mutually  exclusive  is 
expected  from  basic  physicochemical  considerations.  This  has 
also  been  demonstrated  directly,  in  a  context  unrelated  to  the 
present  discussion.  When  lisinopril,  an  inhibitor  of  the  angio¬ 
tensin-converting  enzyme  (ACE),  was  connected,  via  an  11- 
atom  linker,  to  the  biotin  moiety,  the  resulting  bivalent  com¬ 
pound  could  bind  to  and  inhibit  ACE  in  the  absence  but  not 
in  the  presence  of  the  biotin-binding  protein  streptavidin  (16). 
Small  compounds  comprising  two  linker-connected  moieties 
such  as  cyclosporin  and  FK506,  which  are  specific  for  two 
macromolecular  targets,  have  previously  been  employed  as  in 
vivo  dimerization  devices,  making  it  possible  to  bring  together 
two  otherwise  non  interacting  proteins  (17).  However,  the 
linker  moiety  of  these  bipartite  compounds  was  chosen  to  allow 
simultaneous  interactions  with  the  targets,  in  contrast  to  the 
mutual  exclusivity  of  interactions  in  the  IC  approach  (Fig.  1). 

If  the  moiety  i  is  an  inhibitor  of  an  essential  cellular  enzyme  I, 
the  presence  of  the  macromolecular  targets  A  or  B  in  a  cell  would 
reduce  the  inhibition  of  enzyme  I  by  abi,  because  the  complexes 
abi-A  and  abi-B  would  be  mutually  exclusive  with  the  complex 
abi-I  (Fig.  1).  Note  that  A  and  B  are  codominant  in  their  ability 
to  reduce  the  inhibition  of  I  by  abi.  Therefore,  there  is,  formally, 
no  limit  on  the  number  of  a,  b-like  competition  modules  that  can 
be  used  to  construct  an  abi-like  compound  whose  activity  is 
sensitive  to  the  presence  of  several  distinct  macromolecules, 
called  negative  targets.  The  fractional  occupancy  of  the  macro¬ 
molecular  targets  A,  B,  and  I  by  the  a,  b,  and  i  moieties  of  abi 
would  be  determined  in  part  by  the  targets’  intracellular  concen¬ 
trations.  There  are  also  specific  pharmacokinetic  constraints  on 
the  selectivity  of  abi,  an  issue  discussed  below. 

A  tabulation  of  the  relative  toxicities  of  abi  for  cells  that  either 
lack  or  contain  the  negative  targets  A  and  B  is  shown  in  Fig.  2.  It 
can  be  seen  that  abi  would  be  relatively  nontoxic  to  three  of  the 
four  cell  types  and  toxic  exclusively  to  the  cells  that  lack  both  A 
and  B  (Fig.  2).  Thus,  the  IC  concept  allows  the  construction  of 
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Fig.  1.  The  interference/codomi¬ 
nance  concept.  (1-4)  A  small  (<1 
kDa)  moiety  i  is  linked  to  two  other 
small  moieties,  a  and  b.  The  moieties 
a,  b,  and  i  are  ligands  of  the  macro¬ 
molecules  A,  B,  and  I,  respectively. 
The  distances  between  a,  b,  and  i,  and 
their  mutual  arrangement  in  the  tri¬ 
partite  compound  abi  are  such  that 
the  interaction  i-I  is  mutually  exclu¬ 
sive  with  either  the  interaction  a-A  or 
the  interaction  b-B.  Specifically,  the  macromolecule  I  in  its  complex  with  the  small  moiety  i  would  sterically  clash  with  the  macromolecules  A  or 
B  if  either  A  or  B  is  positioned  to  bind  a  or  b  of  abi  (5  and  6).  In  the  diagram,  the  interactions  a-A  and  b-B  are  also  mutually  exclusive,  but  this 
constraint  is  not  essential.  (Note  that  if  the  interactions  a-A  and  b-B  were  mutually  nonexclusive,  the  compound  abi  would  promote  the  binding 
of  A  to  B.)  The  codominance  aspect  of  the  IC  concept  allows  this  design  to  accommodate  more  than  two  of  the  a,  b-like  competition  modules  (not 
shown).  In  Figs.  2-4,  the  i  moiety  is  an  inhibitor  of  an  essential  enzyme  I.  In  fact,  the  only  constraint  on  the  identities  of  i  and  I  is  the  requirement 
for  an  i-I  interaction  to  alter  the  functional  activity  of  a  macromolecule  I.  In  other  words,  the  choice  of  I  is  determined  by  the  intended  effect  of 
the  (unsequestered)  compound  abi  (see  the  main  text). 


small  compounds  that  exhibit  multitarget  selectivity  for  negative 
targets.  One  more  idea  is  required  to  accomplish  the  same  for 
positive  targets  and  to  link  the  two  strategies. 

Multitarget  Compounds  Specific  for  Positive  Targets:  The 
Concept  of  Antieffectors.  Consider  a  small  compound  i*  that 
binds  to  enzyme  I  in  the  vicinity  of  its  active  site,  but  does  not 
perturb  the  catalytic  activity  of  I  toward  its  physiological 
substrates  (Fig.  3 B).  Suppose  further  that  the  compound  i*, 
termed  an  antiinhibitor,  was  designed  to  interfere,  sterically, 
with  the  binding  of  an  inhibitor  i  to  the  enzyme’s  active  site 
while  at  the  same  time  allowing  the  binding  of  physiological 
substrates.  One  way  to  achieve  this  would  be  to  endow  either 
i,  or  i*,  or  both  of  them  with  a  set  of  chemical  groups,  termed 
a  “bump,”  whose  function  is  to  produce  steric  hindrance  that 
makes  the  interactions  i-I  and  i*-I  mutually  exclusive  (Fig.  3^4 
and  B ).  A  moiety  that  functions  as  a  bump  may  also  be 
designed  to  enhance  specific  binding  of  either  the  inhibitor  i  or 
the  antiinhibitor  i*  to  enzyme  I,  but  this  consideration  is 
secondary  to  the  bump’s  essential  purpose. 

In  one  application  of  the  antiinhibitor  i*,  it  is  linked  to  c,  a 
small  moiety  that  can  bind  to  a  macromolecular  target  C.  The 
mutual  arrangement  of  i*  and  c  in  ci*  is  such  that  the 
interactions  of  ci*  with  I  and  C  are  mutually  exclusive.  In  the 
absence  of  C,  ci*  would  compete  with  abi  for  the  binding  to 


enzyme  I,  thereby  partially  protecting  I  from  inhibition  by  abi 
(Fig.  3 C).  This  protective  effect  of  ci*  would  be  suppressed  in 
the  presence  of  its  macromolecular  target  C  (Fig.  3D).  In  the 
logic  of  codominance,  discussed  above  in  the  context  of 
negative  targets,  a  compound  bearing  an  antiinhibitor  moiety 
i*  could  contain  more  than  one  c-like  moiety.  For  example,  a 
compound  cdi*,  whose  moieties  c  and  d  can  bind,  respectively, 
to  the  macromolecules  C  and  D,  would  reduce  the  inhibition 
of  enzyme  I  by  abi  only  in  the  absence  of  both  C  and  D.  Yet 
another  pattern  of  multitarget  selectivity  can  be  produced,  in 
this  context,  by  separating  the  competition  moieties  c  and  d. 
The  resulting  ci*  and  di*,  if  administered  together  with  abi, 
would  reduce  the  inhibition  of  enzyme  I  by  abi  if  just  one  of 
the  targets,  C  or  D,  is  absent.  As  shown  below,  the  key  merit 
of  the  antiinhibitor  idea  is  that  it  allows  the  effect  of  a  single 
inhibitor  i  to  be  modulated  by  both  negative  and  positive 
macromolecular  targets. 

On  the  Difference  Between  Antieffectors  and  Antagonists. 

The  distinctions  between  substrates,  inhibitors,  and  antiinhibi¬ 
tors  were  described  above.  The  concept  of  antieffectors  is  also 
relevant  to  ligand-binding  biopolymers  other  than  enzymes. 
For  example,  an  agonist  binds  to  its  receptor  and  evokes  a 
physiological  response.  An  antagonist  binds  to  a  site  of  the 
receptor  that  overlaps  with  the  agonist-binding  site,  does  not 
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Fig.  2.  Multitarget  selectivity  of  a 
compound  that  utilizes  interference/ 
codominance.  This  diagram  tabulates 
the  relative  toxicities  of  the  compound 
abi  for  cells  that  either  lack  or  contain 
macromolecular  targets  A  and  B.  The  i 
moiety  of  the  compound  abi  (see  the 
legend  to  Fig.  1)  inhibits  an  essential 
enzyme  I.  The  interaction  i-I  is  mutu¬ 
ally  exclusive  with  the  interaction  a-A 
and  the  interaction  b-B,  the  macromol¬ 
ecules  A  and  B  being  negative  targets  of 
abi.  It  is  assumed  that  concentrations 
of  the  targets  A  and  B  in  cells  that 
contain  at  least  one  of  them  signifi¬ 
cantly  exceed  the  concentration  of  I 
(see  the  main  text).  In  A+  B+,  A+  B“, 
and  A"  B+  cells,  the  enzyme  I  would 
be  at  most  partially  inhibited  by  the  i 
moiety  of  abi,  because  of  the  compet¬ 
ing  interactions  of  abi  with  A  and/or  B. 
By  contrast,  in  A-  B_  cells,  the  bulk  of 
abi  molecules  would  be  available  for 
interaction  with  I,  resulting  in  the  se¬ 
lective  toxicity  of  abi  to  these  cells.  The 
selectivity  pattern  of  abi  requires  that 
certain  pharmacokinetic  conditions 
are  met  as  well  (see  the  main  text). 
Note  that  the  physiological  effects  and 
the  uses  of  abi-type  compounds  are  not 
confined  to  cytotoxic  regimens. 
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Fig.  3.  The  antieffector  concept.  The  particular  case  illustrated 
here  is  that  of  an  antiinhibitor  i*,  defined  as  a  compound  whose 
binding  to  an  enzyme  I  does  not  inhibit  the  activity  of  I  but  does 
preclude  the  inhibition  of  I  by  an  inhibitor  i.  In  this  example,  the 
antiinhibitor  i*  has  the  following  properties.  First,  it  binds  to  I  in  the 
vicinity  of  the  I’s  active  site,  but  does  not  perturb  the  catalytic  activity 
of  I  toward  its  physiological  substrates.  Second,  i*  in  its  bound  state, 
sterically  interferes  with  the  interaction  between  I  and  its  inhibitor  i. 
To  implement  the  second  condition,  either  i,  or  i*,  or  both  of  them 
bear  additional  moiety,  a  “bump,”  denoted  by  the  rectangular  pro¬ 
trusions  in  i  and  i*.  The  function  of  the  bump  is  to  produce  steric 
hindrance  that  makes  the  interactions  i-I  and  i*-I  mutually  exclusive. 
The  inhibitor  i  described  here  and  in  the  main  text  is  a  competitive 
inhibitor,  but  i  could  be  a  noncompetitive  inhibitor  as  well.  An 
allosteric  antiinhibitor,  which  functions  through  binding  to  a  remote 
site  of  enzyme  I,  is  yet  another  possibility.  (A)  A  complex  of  the 
enzyme  I  with  its  inhibitor  i.  (B)  A  complex  of  I  with  its  antiinhibitor 
i*.  Note  that  the  bumps  of  the  bound  i  and  i*  spatially  overlap.  (C)  The 
antiinhibitor  i*  is  linked  to  c,  a  small  moiety  that  can  bind  to  a 
macromolecular  target  C.  The  design  of  ci*  is  analogous  to  abi  (Figs. 
1  and  2),  in  that  the  interactions  of  ci*  with  I  and  C  are  mutually 
exclusive.  When  ci*  is  bound  to  I,  the  inhibitor  i,  shown  here  as  a  part 
of  the  compound  abi  (Figs.  1  and  2),  is  unable  to  bind  to  and  inhibit 
the  enzyme  I.  ( D )  A  complex  between  ci*  and  its  macromolecular 
target  C.  This  complex,  being  mutually  exclusive  with  the  ci*-I 
complex,  reduces  the  ability  of  ci*  to  protect  the  enzyme  I  from 
inhibition  by  abi. 

activate  the  receptor,  and  in  addition  precludes  the  binding  of 
agonist  (1,  2).  By  contrast,  an  antieffector,  which  would  be 
called,  in  this  setting,  an  antiantagonist,  binds  to  the  receptor 
in  such  a  way  that  the  receptor  can  still  bind,  and  respond  to, 
the  agonist,  but  cannot  bind  the  antagonist.  To  this  end,  either 
an  antagonist,  or  an  antiantagonist,  or  both  must  possess  a 
bump,  an  additional  moiety  described  above  in  the  context  of 


enzymes  and  antiinhibitors  (Fig.  3).  The  idea  of  antieffectors 
is  thus  distinct  from  that  of  antagonists  or  inhibitors  and  is  new, 
to  the  best  of  my  knowledge.  I  am  also  not  aware  of  a  naturally 
occurring  pair  of  compounds  that  satisfy  the  definition  of 
effectors/antieffectors  in  a  physiologically  relevant  setting. 

Inter ference/Codominance  and  Antieffector  in  a  Regimen 
That  Possesses  Combinatorial  Selectivity.  Applying  the  IC  and 
antieffector  concepts  together  yields  regimens  that  possess  true 
combinatorial  selectivity,  i.e.,  sensitivity  to  both  negative  and 
positive  targets.  Consider  a  population  of  cells  that  either  contain 
or  lack  the  macromolecular  targets  A,  B,  and  C.  Our  aim  is  to 
devise  a  treatment  that  would  be  toxic  to  cells  that  lack  A  and  B 
but  contain  C  (A"  B~  C+  cells)  and  relatively  nontoxic  to  the 
other  cell  types  (Fig.  4).  A  regimen  of  two  compounds,  abi  (Fig. 
3C)  and  ci*  (Fig.  3D),  has  the  requisite  selectivity,  as  shown  in  Fig. 
4,  which  tabulates  the  outcomes  of  this  treatment  for  different  cell 
types.  Specifically,  in  the  absence  of  C  (four  cell  types  out  of 
eight),  the  antiinhibitor-containing  ci*  would  compete  with  the 
inhibitor-containing  abi  for  binding  to  the  essential  enzyme  I, 
thereby  reducing  the  inhibition  of  I  by  abi,  and  hence  reducing  the 
toxicity  of  abi.  In  three  other  cell  types,  whose  common  property 
is  the  presence  of  C  and  at  least  one  of  the  other  two  targets,  A 
or  B,  the  antiinhibitor-containing  ci*  would  be  largely  seques¬ 
tered  by  C,  and  hence  inactive,  but  the  inhibitor-containing  abi 
would  be  sequestered  as  well,  by  either  A  or  B.  In  only  one  type 
of  cells,  those  that  lack  A  and  B  but  contain  C  (A"  B~  C+  cells), 
is  the  inhibitor  abi  fully  available  for  interaction  with  I,  resulting 
in  higher  toxicity  (Fig.  4).  The  differences  in  the  toxicity  of  abi  to 
different  cell  types  would  be  determined  by  the  relative  stoichio¬ 
metries  and  absolute  concentrations  of  the  cellular  targets  in¬ 
volved  (A,  B,  C,  and  I),  by  the  affinities  of  the  moieties  a,  b,  c,  i, 
and  i*  for  these  targets,  and  by  the  pharmacokinetic  properties  of 
abi  and  ci*. 

Straightforward  variations  of  the  abi  and  ci*  designs  that 
utilize  IC  and  the  properties  of  antiinhibitors  would  allow 
selective  targeting  of  any  one  of  the  eight  cell  types  that  differ 
by  the  presence  or  absence  of  three  macromolecular  targets. 
Moreover,  there  is  no  formal  limit  on  the  total  number  of 
negative  and/or  positive  targets  that  can  be  simultaneously 
sensed  by  regimens  that  employ  abi-  and  ci*-type  compounds 
bearing  multiple  interference  moieties.  Note  that  the  cell  type 
selectivity  of  regimens  such  as  abi  +  ci*  (Fig.  4)  is  analogous 
to  the  outcomes  of  the  Boolean  operations  “and,”  “or,”  and 
combinations  thereof. 

Stoichiometries,  Affinities,  and  Pharmacokinetics.  The  se¬ 
lectivity  of  the  proposed  compounds  results  from  mutually 
exclusive,  competing  interactions  between  individual  moieties 
of  these  compounds  and  their  macromolecular  targets  (Figs. 
2-4);  hence,  the  importance  of  the  targets’  intracellular  con¬ 
centrations,  relative  to  each  other  and  the  enzyme  I,  which  is 
inhibited  by  an  effector  moiety  of  these  drugs.  The  choice  of 
I  is  not  confined  to  essential  enzymes.  The  target  I  could  be, 
for  instance,  a  DNA-binding  repressor  of  terminal  differenti¬ 
ation,  a  repressor  of  apoptosis,  or,  in  the  example  of  a 
noncytotoxic  therapy,  a  neurotransmitter-inactivating  enzyme. 
In  other  words,  the  choice  of  I  is  determined  by  the  intended 
effect  of  the  (unsequestered)  compound  abi. 

The  sequestration  of  abi-  and  ei*-type  compounds  by  their 
macromolecular  ligands  A,  B,  and  C  serves  to  prevent  their 
binding  to  the  enzyme  I  (Figs.  3  and  4).  Therefore,  in  schemes 
of  the  type  considered  above,  the  molar  concentration  of  I 
should  be  significantly  (if  possible,  considerably)  lower  than 
the  molar  concentrations  of  A,  B,  and  C.  In  addition,  the 
concentration  of  ci*  in  a  ci*  +  abi  regimen  should  significantly 
exceed  that  of  abi,  because  ci*  is  the  sole  obstacle  to  the 
inhibition  of  enzyme  I  by  abi  in  the  A”  B“  C“  cells  (Fig.  4). 
It  is  assumed,  furthermore,  that  the  total  intracellular  con¬ 
centrations  of  abi  and  ci*,  bound  and  unbound,  would  remain 
significantly  below  the  concentrations  of  the  interference 
targets  A,  B,  and  C.  The  affinities  of  A,  B,  C,  and  I  for  the 
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Fig.  4.  Combinatorial  (positive/ 
negative)  selectivity  of  a  regimen  that 
utilizes  interference/codominance 
and  antiinhibitor.  The  diagram  tabu¬ 
lates  the  relative  toxicities  of  the  com¬ 
pound  abi  in  the  setting  of  a  two- 
compound  treatment  of  cells  that  ei¬ 
ther  lack  or  contain  the 
macromolecular  targets  A,  B,  and  C. 
The  inhibitor-containing  compound 
abi  is  described  in  the  main  text  and 
Figs.  1,  2,  and  3 C.  The  antiinhibitor- 
containing  compound  ci*  is  described 
in  the  main  text  and  Fig.  3.  A  and  B  are 
negative  targets,  in  that  they  reduce, 
through  the  binding  to  the  moieties  a 
and  b  of  abi,  the  inhibition  of  an 
essential  enzyme  I  by  abi.  C  is  a 
positive  target,  in  that  it  reduces, 
through  the  binding  to  the  c  moiety  of 
ci*,  the  binding  of  ci*  to  enzyme  I. 
This  results  in  a  larger  fraction  of 
enzyme  I  available  for  the  inhibition 
by  abi.  It  is  assumed  that  the  concen¬ 
trations  of  A,  B,  and  C  in  cells  that 
contain  them  significantly  exceed  the 
concentration  of  I  (see  the  main  text). 
In  all  of  the  cell  types  except  A~  B 
C+,  the  enzyme  I  would  be  at  most 
partially  inhibited  by  the  i  moiety  of 
abi,  because  of  the  competing  inter¬ 
actions  of  abi  with  A  and/or  B,  and 
also  because  in  C-  cells  a  fraction  of 
enzyme  I  would  be  protected  from  the 
inhibition  by  abi  through  the  interac¬ 
tion  of  I  with  the  antiinhibitor  moiety 
I*  of  ci*.  By  contrast,  in  A-  B~  C+ 
cells,  the  antiinhibitor-containing  ci* 
would  be  sequestered  by  C,  whereas 
abi  would  not  be  sequestered  by  A  or 
B,  which  are  absent  from  these  cells. 
As  a  result,  a  larger  fraction  of  the  inhibitor-containing  abi  molecules  would  be  available  for  the  interaction  with  I,  resulting  in  the  selective  toxicity 
of  abi  to  A~  B~  C+  cells.  The  selectivity  pattern  of  abi  requires  that  certain  pharmacokinetic  conditions  are  met  as  well  (see  the  main  text).  Note 
that  the  physiological  effects  and  the  uses  of  abi-type  compounds  are  not  confined  to  cytotoxic  regimens. 
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respective  moieties  of  the  compounds  abi  and  ci*,  and  also  the 
targets’  intracellular  locations  are  among  the  independent 
parameters  that  can  be  varied  in  designing  these  compounds. 

Yet  another,  and  major,  constraint  on  the  nature  and 
pharmacokinetics  of  multitarget  drugs  stems  from  the  fact  that 
the  selectivity  patterns  described  above  (Figs.  2  and  4)  may  not 
be  observed  under  equilibrium  conditions,  where  the  influx  of 
a  drug  into  cells  equals  its  outflux.  To  illustrate  one  clear 
difficulty,  let  us  oversimplify  and  suppose  that  an  abi-type 
compound  is  metabolically  inert,  in  addition  to  being  capable 
of  crossing  the  plasma  membranes  and  other  lipid  bilayers.  If 
abi  is  initially  outside  the  cells,  and  if  the  extracellular  pool  of 
abi  (e.g.,  in  the  blood  plasma)  is  large  enough,  it  can  be  shown 
that  the  subsequently  reached  equilibrium  state  would  be 
characterized  by  equal  concentrations  of  the  free  abi  in  both 
the  A+  B+  and  A-  B“  cells,  thereby  resulting  in  the  equal 
occupancies  of  the  enzyme  I  by  abi  in  these  cells,  contrary  to 
the  pattern  illustrated  in  Fig.  2. 

By  contrast,  the  selectivity  patterns  of  Figs.  2  and  4  would  be 
observed  during  the  initial  influx  of  drugs  into  cells.  Thus,  one 
requirement  for  the  multitarget  selectivity  of  abi-  and  ci*-type 
regimens  is  the  avoidance  of  equilibrium  states  such  as  the  one 
described  above.  This  and  related  considerations  indicate  that 
despite  the  logical  simplicity  of  the  proposed  designs,  their 
implementation  will  have  to  address  pharmacokinetic  prob¬ 
lems  that  do  not  necessarily  arise  with  single-target  drugs. 

Selection  of  Targets  and  Construction  of  Multitarget  Drugs. 
Appropriate  macromolecular  targets  of  the  A-C  class  (Figs.  3  and 
4)  are  suggested  by  the  protein  composition  of  the  tumor  cells  to 


be  eliminated.  The  choice  of  an  essential  intracellular  enzyme  I 
(Figs.  2-4)  is  determined  by  the  presence  of  I  at  least  in  tumor  cells, 
its  physiological  concentration,  and  the  feasibility  of  an  efficacious 
inhibitor  of  I.  Among  potentially  suitable  enzymes  for  which 
cell-penetrating  inhibitors  already  exist  is  dihydrofolate  reductase. 
Its  high-affinity  inhibitors  include  methotrexate,  which  enters  cells 
through  carrier-mediated  pathways,  and  the  more  lipophilic  tri- 
metrexate,  which  can  enter  cells  by  diffusing  through  lipid  bilayers 
(18). 

Each  of  the  modules  in  the  abi-  and  ci*-type  compounds  (Figs. 
1-4)  would  bind  its  macromolecular  target  in  the  absence  of  the 
other  modules.  Therefore,  the  cytotoxic  i-type  modules  of  IC- 
based  compounds  would  be  similar  to  the  stand-alone  cytotoxic 
drugs  of  today.  By  contrast,  the  interference  modules  of  these 
compounds,  i.e.,  their  a-,  b-,  and  c-type  moieties  (Figs.  1-3),  are 
supposed  to  bind  to  their  macromolecular  targets  but  preferably 
not  impair  them  functionally.  This  specification  of  a  competition 
module  simplifies  its  design  in  comparison  to  that  of  inhibitors, 
because  many  sites  on  the  target’s  surface,  and  not  just  the  active 
site,  would  be  acceptable. 

Antiinhibitors  (Fig.  3)  are  a  new  class  of  physiologically  active 
compounds.  The  opportunities  and  problems  of  their  design  are 
similar  to  those  for  the  interference  moieties  a-c  (Figs.  1-3),  but 
there  are  two  other  difficulties  as  well.  First,  the  antiinhibitor 
moiety  i*  must  bind  in  the  vicinity  of,  but  not  at,  the  active  site 
of  enzyme  I.  Second,  the  moiety  i*  must  also  bear  chemical 
groups  (a  bump)  whose  function  is  to  preclude,  through  steric 
hindrance,  the  binding  of  the  inhibitor  moiety  i  to  the  enzyme  I 
(Fig.  3). 
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A  substrate-binding  cleft  is  not  the  only  indentation  in  a 
folded  protein  molecule.  The  other  clefts  tend  to  be  smaller, 
but  they  are  present  as  well  (19),  and  some  of  them  may  be 
located  next  to  the  enzyme’s  active  site  (Fig.  3).  In  addition, 
even  relatively  flat  molecular  surfaces  can  be,  in  principle,  the 
sites  of  high-affinity  interactions  with  small  ligands  (20).  These 
optimistic  comments  notwithstanding,  the  development  of 
antiinhibitors  is  certain  to  be  a  complex  undertaking.  As 
discussed  above,  the  pharmacokinetic  aspects  of  the  proposed 
designs  are  also  complex.  Yet  simplicity  is  good  only  if  it  works. 
Single-target  anticancer  drugs  remain  unsatisfactory,  in  spite 
of  decades  of  immense  effort.  It  may  therefore  be  wise  to 
attempt  a  more  complex  but  also  more  effective  solution. 

A  recent  advance  in  drug  design,  termed  SAR  by  NMR 
(structure-activity  relationships  by  nuclear  magnetic  reso¬ 
nance),  provides  an  especially  promising  route  to  constructing 
ligands  for  specific  regions  of  a  protein  molecule  (21).  In  this 
approach,  a  library  of  small  molecules  is  screened  for  binding 
to  an  15N-labeled  protein  by  using  NMR,  which  can  detect 
weak  interactions,  and  in  addition  assigns  them  to  specific 
nitrogens  of  a  protein,  thereby  identifying  the  site  of  binding. 
Finding  two  small  compounds  that  bind  to  adjacent  patches  of 
the  target  protein  molecule  and  covalently  linking  these  com¬ 
pounds  produces  a  higher-affinity  ligand.  This  powerful  strat¬ 
egy  (21),  which  already  yielded  tightly  binding  ligands  of 
specific  proteins,  may  prove  sufficient  for  constructing  the  a-c 
competition  modules  and  the  i*  antiinhibitor  modules  of  the 
proposed  designs  (Figs.  1-4). 

Noncytotoxic  Multitarget  Drugs.  Many  useful  drugs  are  the 
inhibitors  of  intracellular  enzymes  that  are  not  essential  for  cell 
viability  (2).  The  problem  of  insufficient  selectivity  is  relevant 
to  these  drugs  as  well.  For  example,  even  an  exquisitely  specific 
inhibitor  of  a  clinically  relevant  enzyme  is  likely  to  have 
significant  side  effects,  because  the  target  enzyme  is  present, 
in  most  cases,  not  only  in  the  cells  where  its  inhibition  is 
clinically  beneficial  but  also  in  the  cells  where  its  inhibition  is 
physiologically  inappropriate.  The  logic  of  abi-type  inhibitors 
(Figs.  1  and  2)  and  cP-type  antiinhibitors  (Figs.  3  and  4)  is 
applicable  in  these  settings,  because  an  informed  choice  of  the 
competition  moieties  a,  b,  and  c  would  sharpen  up  the  cell 
selectivity  of  the  inhibitor  moiety  i  in  the  way  described  above 
for  cytotoxic  drugs  (Fig.  4),  resulting  in  the  inhibition  of  the 
(nonessential)  enzyme  I  in  a  predetermined  subset  of  the 
enzyme-containing  cells.  Note  that  the  same  considerations 
apply  to  extracellular  settings  as  well.  The  examples  above  are 
but  a  glimpse  of  the  drug-engineering  vistas  that  are  opened 
up  by  the  IC  and  antieffector  concepts.  At  the  same  time,  there 
are  significant  pharmacokinetic  constraints  on  the  properties 
of  the  proposed  drugs,  as  discussed  above.  These  constraints 
are  likely  to  complicate  the  implementation  of  the  IC/ 
anti  effector  strategies. 

The  Problem  of  Drug  Resistance.  With  small  anticancer 
drugs  that  are  in  use  today,  the  macromolecular  target  of  a 
drug  serves  two  distinct  functions.  First,  the  target  is  a 
cell-selectivity  determinant  that  may  bias  the  treatment  against 
tumor  cells.  Second,  the  target  is  also  a  device  whose  inhibition 
by  the  drug  brings  about  the  desired  effect,  e.g.,  cell  death. 
Consequently,  when  drug-resistant  tumor  cells,  selected  by  a 
drug  treatment,  present  themselves  as  a  cancer  recurrence,  the 
necessity  of  employing  another  therapeutic  agent  (if  such  an 
option  exists)  robs  the  physician  of  whatever  cell-selectivity 
advantage  there  was  with  the  earlier  drug. 

The  situation  is  qualitatively  different  with  IC-based  com¬ 
pounds.  Suppose  that  a  treatment  that  included  the  drug  abi 
(Figs.  1  and  2)  results  in  the  appearance  of  abi-resistant  tumor 
cells  that  contain,  for  example,  an  altered  or  overproduced 
enzyme  I.  If  so,  replacement  of  the  i  moiety  by  another  small 
cytotoxic  moiety,  specific  for  another  essential  enzyme,  would 
retain  the  cell  selectivity  of  the  new  ab-containing  drug.  Thus, 


one  advantage  of  modularity  inherent  in  the  designs  of  IC/ 
antiinhibitor-based  compounds  (Figs.  1-4)  lies  in  the  separa¬ 
tion  of  the  effector  aspect  of  a  drug  from  its  selectivity  aspect. 
As  a  result,  once  an  efficacious  arrangement  of  the  selectivity 
modules  in  abi-  or  ci*-type  compounds  has  been  identified,  it 
can  be  reutilized  in  drugs  bearing  effector  moieties  other  than 
i  and  i*. 

Concluding  Remarks.  The  above  considerations  are  based 
on  the  existing  understanding  of  single-target  drugs  and  on  the 
notion  of  steric  hindrance.  By  introducing  the  new  concept  of 
antieffectors  and  a  modification  of  the  previously  proposed 
idea  of  codominant  interference  (14),  we  can  now  attempt  the 
construction  of  small  modular  compounds  that  possess  a 
multitarget,  combinatorial  selectivity  (Fig.  4).  The  IC/ 
antieffector  strategies  are  not  confined  to  cytotoxic  therapies 
and  are  relevant,  in  principle,  to  all  pharmacological  settings. 
As  indicated  above,  one  expected  difficulty  in  implementing 
these  strategies  stems  from  significant  pharmacokinetic  con¬ 
straints  that  do  not  necessarily  arise  with  single-target  drugs. 

This  work  was  motivated  by  the  premise  that  the  confine¬ 
ment  of  anticancer  drug  research  and  development  to  single¬ 
target  compounds  will  prove  insufficient  for  the  task  at  hand, 
because  even  the  informed  choices  of  targets  for  such  drugs 
may  not  define  unambiguously  enough  the  cell  type  to  be 
eliminated.  The  remedy,  described  above,  is  to  aim  for  drugs 
that  possess  qualitatively  different  selectivity — multitarget  and 
combinatorial.  If  this  view  is  correct,  the  future  ascent  of 
multitarget  drugs  may  transform  not  only  the  treatment  of 
cancer  but  also  approaches  in  other  settings  where  the  killing 
or  modification  of  undesirable  cells  or  organelles  is  carried  out 
in  the  presence  of  nearly  identical  cells  or  organelles  that  must 
be  spared.  These  applications  of  multitarget  drugs  encompass 
more  discriminating  antiviral  and  antifungal  therapies,  as  well 
as  the  selective  killing  of  activated  lymphocytes  in  autoimmune 
diseases  and  the  selective  elimination  of  damaged  mitochon¬ 
dria  in  aging  cells  (14, 15).  In  yet  another  class  of  applications, 
a  noncytotoxic  multitarget  drug  would  be  used  to  inhibit  a 
clinically  relevant  nonessential  enzyme  in  a  specific  subset  of 
the  enzyme-containing  cells,  thereby  retaining  the  benefits  of 
inhibition  while  reducing  its  side  effects. 
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ABSTRACT  The  N-end  rule  relates  the  in  vivo  half-life  of  a 
protein  to  the  identity  of  its  N-terminal  residue.  The  N-end  rule 
pathway  is  one  proteolytic  pathway  of  the  ubiquitin  system.  The 
recognition  component  of  this  pathway,  called  N-recognin  or  E3, 
binds  to  a  destabilizing  N-terminal  residue  of  a  substrate  protein 
and  participates  in  the  formation  of  a  substrate-linked  multiu- 
biquitin  chain.  We  report  the  cloning  of  the  mouse  and  human 
Ubrl  cDNAs  and  genes  that  encode  a  mammalian  N-recognin 
called  E3a.  Mouse  UBRlp  (E3a)  is  a  1,757-residue  (200-kDa) 
protein  that  contains  regions  of  sequence  similarity  to  the 
225 -kDa  Ubrlp  of  the  yeast  Saccharomyces  cerevisiae.  Mouse  and 
human  UBRlp  have  apparent  homologs  in  other  eukaryotes  as 
well,  thus  defining  a  distinct  family  of  proteins,  the  UBR  family. 
The  residues  essential  for  substrate  recognition  by  the  yeast 
Ubrlp  are  conserved  in  the  mouse  UBRlp.  The  regions  of 
similarity  among  the  UBR  family  members  include  a  putative 
zinc  finger  and  RING-H2  finger,  another  zinc-binding  domain. 
Ubrl  is  located  in  the  middle  of  mouse  chromosome  2  and  in  the 
syntenic  15ql5-q21.1  region  of  human  chromosome  15.  Mouse 
Ubrl  spans  ^120  kilobases  of  genomic  DNA  and  contains  ^50 
exons.  Ubrl  is  ubiquitously  expressed  in  adults,  with  skeletal 
muscle  and  heart  being  the  sites  of  highest  expression.  In  mouse 
embryos,  the  Ubrl  expression  is  highest  in  the  branchial  arches 
and  in  the  tail  and  limb  buds.  The  cloning  of  Ubrl  makes  possible 
the  construction  of  UftrMacking  mouse  strains,  a  prerequisite 
for  the  functional  understanding  of  the  mammalian  N-end  rule 
pathway. 


A  number  of  regulatory  circuits  involve  metabolically  unstable 
proteins.  Short  in  vivo  half-lives  are  also  characteristic  of  damaged 
or  otherwise  abnormal  proteins  (1-4).  Features  of  proteins  that 
confer  metabolic  instability  are  called  degradation  signals,  or 
degrons.  The  essential  component  of  one  degradation  signal, 
called  the  N-degron,  is  a  destabilizing  N-terminal  residue  of  a 
protein  (5, 6).  The  set  of  amino  acid  residues  that  are  destabilizing 
in  a  given  cell  type  yields  a  rule,  called  the  N-end  rule,  which 
relates  the  in  vivo  half- life  of  a  protein  to  the  identity  of  its 
N-terminal  residue.  Similar,  but  distinct,  versions  of  the  N-end 
rule  pathway  are  present  in  all  organisms  examined,  from  mam¬ 
mals  to  fungi  and  bacteria  (6-8). 

In  eukaryotes,  the  N-degron  comprises  two  determinants:  a 
destabilizing  N-terminal  residue  and  an  internal  lysine  or  lysines 
(8).  The  Lys  residue  is  the  site  of  formation  of  a  multiubiquitin 
chain  (9).  The  N-end  rule  pathway  is  thus  one  pathway  of  the 
ubiquitin  (Ub)  system.  Ub  is  a  76-residue  protein  whose  covalent 
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conjugation  to  other  proteins  plays  a  role  in  a  multitude  of 
processes,  including  cell  growth,  division,  differentiation,  and 
responses  to  stress  (1, 3, 4, 10).  In  most  of  these  processes,  Ub  acts 
through  routes  that  involve  the  degradation  of  Ub-protein  con¬ 
jugates  by  the  26S  proteasome,  an  ATP-dependent  multisubunit 
protease  (11). 

The  N-end  rule  is  organized  hierarchically.  In  the  yeast  Sac¬ 
charomyces  cerevisiae ,  Asn  and  Gin  are  tertiary  destabilizing 
N-terminal  residues  in  that  they  function  through  their  enzymatic 
deamidation  into  the  secondary  destabilizing  N-terminal  residues 
Asp  and  Glu  (12).  The  destabilizing  activity  of  N-terminal  Asp 
and  Glu  requires  their  enzymatic  conjugation  to  Arg,  one  of  the 
primary  destabilizing  residues  (6).  The  primary  destabilizing 
N-terminal  residues  are  bound  directly  by  the  UBR1  -encoded 
N-recognin  (also  called  E3),  the  recognition  component  of  the 
N-end  rule  pathway  (13).  In  S.  cerevisiae ,  N-recognin  is  a  225-kDa 
protein  that  binds  to  potential  N-end  rule  substrates  through  their 
primary  destabilizing  N-terminal  residues — Phe,  Leu,  Trp,  Tyr, 
lie,  Arg,  Lys,  and  His.  N-recognin  has  at  least  two  substrate¬ 
binding  sites.  The  type  1  site  is  specific  for  the  basic  N-terminal 
residues  Arg,  Lys,  and  His.  The  type  2  site  is  specific  for  the  bulky 
hydrophobic  N-terminal  residues  Phe,  Leu,  Tip,  Tyr,  and  lie  (6). 

The  known  functions  of  the  N-end  rule  pathway  include  the 
control  of  peptide  import  in  S.  cerevisiae  (through  degradation  of 
Cup9p,  a  transcriptional  repressor  of  the  peptide  transporter 
Ptr2p);  a  role  in  controlling  the  Sin lp-dependent  phosphoryla¬ 
tion  cascade  that  mediates  osmoregulation  in  S.  cerevisiae ;  the 
degradation  of  Gpalp,  a  G a  protein  of  S.  cerevisiae ;  and  the 
degradation  of  alphaviral  RNA  polymerases  in  virus-infected 
metazoan  cells  (6,  14). 

The  mammalian  counterpart  of  the  yeast  UBR  1 -encoded  N- 
recognin  (E3)  was  characterized  biochemically  in  extracts  from 
rabbit  reticulocytes  (15-17).  Rabbit  E3a  was  shown  to  be  spe- 


Abbreviations:  Ub,  ubiquitin;  kb,  kilobasc;  id.,  identity;  si.,  similarity; 
BAG,  bacterial  artificial  chromosome;  FISH,  fluorescence  in  situ 
hybridization;  en,  embryonic  day. 

Data  deposition:  Nucleotide  sequences  reported  in  this  work  have 
been  deposited  in  the  GenBank  database  [accession  nos.  AF061555 
(mouse  Ubrl  cDNA)  and  AF061556  (human  UBR1  cDNA)]. 

**To  whom  reprint  requests  should  be  addressed  at:  Division  of 
Biology,  147-75,  Caltech,  1200  East  California  Boulevard,  Pasadena, 
CA  91125.  e-mail:  avarsh@cco.caltech.edu. 
ttThe  names  of  mouse  genes  arc  in  italics,  with  the  first  letter 
uppercase.  The  names  of  human  and  S.  cerevisiae  genes  are  also  in 
italics,  all  uppercase.  If  human  and  mouse  genes  are  named  in  the 
same  sentence,  the  mouse  gene  notation  is  used.  The  names  of  S. 
cerevisiae  proteins  are  Roman,  with  the  first  letter  uppercase  and  an 
extra  lowercase  “p”  at  the  end.  The  names  of  the  corresponding 
mouse  and  human  proteins  are  the  same,  except  that  all  letters  but 
the  last  “p”  arc  uppercase.  The  latter  usage  is  a  modification  of  the 
existing  convention  (33),  to  facilitate  simultaneous  discussions  of 
yeast,  mouse,  and  human  proteins.  In  some  citations,  the  abbrevi¬ 
ated  name  of  a  species  precedes  the  gene’s  name. 
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Fig.  1.  Peptides  of  rabbit  UBRlp  (E3a)  and  isola¬ 
tion  of  the  mouse  Ubrt  cDNA.  (A)  Amino  acid  se¬ 
quences  of  tryptic  peptides  of  the  purified  rabbit 
UBRlp  (see  Materials  and  Methods).  The  alternative 
sets  of  peptide  names,  T-based  and  PEP1-PEP3  (in 
parentheses),  refer  to  two  different  preparations  of 
E3a.  The  sequences  of  T120,  T76,  T96,  and  T122  that 
were  encoded  by  DNA  sequences  identified  through 
intrapeptide  PCR  are  underlined.  Residues  deduced 
from  the  mouse  Ubrl  cDNA  that  differed  from  those 
inferred  through  peptide  sequencing  arc  indicated  in  a 
smaller  font.  The  peptides’  positions  in  the  deduced 
sequence  of  mouse  UBRlp  arc  indicated.  (B)  The 
intrapeptide/interpeptide-PCR  cloning  strategy.  The 
products  of  the  initial  intrapeptide  PCR,  derived  from 
rabbit  genomic  DNA,  were  used  to  carry  out  interpep- 
tidc  PCRwith  a  rabbit  liver  cDNA  library  (CLON- 
TECH).  The  resulting  392-bp  fragment  of 
the  rabbit  Ubrl  cDNA  was  used  to  isolate,  using  PCR  and  a  Agtll  mouse  liver  cDNA  library,  the  corresponding  392-bp  mouse  Ubrl  cDNA  fragment. 
This  fragment  then  was  used  to  screen  the  same  cDNA  library,  yielding  a  2.4-kb  fragment  of  the  mouse  Ubrl  cDNA  that  encoded  several  of  the 
peptide -derived  sequences  of  the  rabbit  UBRlp.  The  encoded  sequence  was  also  significantly  similar  to  that  of  the  N-terminal  region  of  S.  cerevisiae  Ubrlp 
(13)  and  contained  the  putative  start  (ATG)  codon  of  the  mouse  Ubrl  ORF.  To  isolate  the  rest  of  the  5'  region  of  the  Ubrl  cDNA,  5'-rapid  amplification 
of  cDNA  ends  (RACE)-PCR  (20)  was  performed  with  poly(A)+  RNA  from  mouse  L  cells  and  a  primer  from  the  2.4-kb  DNA  fragment.  3'-RACE-PCR 
(20)  was  used  to  amplify  a  downstream  region  of  Ubrl  cDNA.  The  resulting  DNA  fragment  (nucleotides  2,470-3,467)  then  was  used  to  screen  a  AgtlO 
mouse  cDNA  library  from  MEL-C19  cells.  Five  overlapping  cDNA  isolates  (MR16,  MR17,  MR19,  MR20,  and  MR23)  that  together  spanned  the  entire 
Ubrl  cDNA  were  mapped  and  subcloned  into  Bluescript  II  SK+  (Stratagene),  yielding  the  plasmid  MR26,  which  contained  the  entire  ORF  of  Ubrl.  The 
ORF  region  of  Ubrl  cDNA  was  sequenced  on  both  strands  at  least  twice,  using  independently  derived  cDNA  clones. 
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cifically  required  for  the  Ub-dependent  degradation  of  proteins 
bearing  either  type  1  (basic)  or  type  2  (bulky  hydrophobic) 
destabilizing  N-terminal  residues  (7,  15,  16). 

We  began  dissection  of  the  mouse  N-end  rule  pathway  by 
isolating  the  Ntanl  gene,  which  encodes  the  asparagine-specific 
N-terminal  amidase  (18,  19),  a  component  of  the  mammalian 
N-end  rule  pathway,  and  by  constructing  mouse  strains  that  lack 
Ntanl  (Y.T.K.  and  A.V.,  unpublished  data).  Herein,  we  describe 
the  cloning  and  characterization  of  the  mouse  and  human  cDNAs 
and  gcncstt  that  encode  UBRlp  (E3a),  a  homolog  of  yeast 
Ubrlp  and  the  main  recognition  component  of  the  N-end  rule 
pathway. 

MATERIALS  AND  METHODS 

Isolation  and  Partial  Sequencing  of  Mammalian  E3a 
(UBRlp).  Rabbit  E3a  was  purified  from  reticulocyte  extracts  by 
using  affinity  chromatography  with  immobilized  protein  sub¬ 
strates  of  UBRlp  and  elution  with  dipeptides  bearing  destabi¬ 
lizing  N-terminal  residues  (16).  The  resulting  preparation  was 


fractionated  by  SDS/PAGE.  The  band  of  ~180-kDa  E3a  was 
excised  and  subjected  to  digestion  with  trypsin.  Amino  acid 
sequences  were  determined  for  14  peptides  of  rabbit  UBRlp  (Fig. 
\A)  by  using  standard  methods  (20). 

Isolation  of  the  Full-Length  Mouse  Ubrl  cDNA.  A  strategy  that 
included  the  intrapeptide-interpeptide  PCR  (21)  was  used  (see 
the  legend  to  Fig.  1). 

Isolation  of  a  Partial  Human  UBR1  cDNA.  Poly(A)+  RNA 
from  human  293  cells  was  subjected  to  reverse  transcription- 
PCR,  using  sets  of  primers  corresponding  to  sequences  of  the 
mouse  Ubrl  cDNA.  One  of  the  reactions  yielded  a  1.0-kilobase 
(kb)  fragment  that  encompassed  a  region  of  the  human  UBR1 
cDNA  (Fig.  2). 

Mouse  and  Human  Genomic  Ubrl  Fragments.  A  library  of 
mouse  genomic  DNA  fragments  (strain  SvJ)  in  bacterial  artificial 
chromosome  (BAC)  (22)  vector  (Genome  Systems,  St.  Louis) 
was  used,  as  described  in  the  legend  to  Fig.  2. 

Northern,  Southern,  and  Whole-Mount  in  Situ  Hybrid¬ 
izations.  Mouse  and  human  multiple-tissue  Northern  blots 
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Fig.  2.  The  mouse  and  human  Ubrl 
cDNAs  and  genes.  Thick  horizontal  lines 
represent  genomic  DNA.  The  upper  one  is 
a  ^31 -kb  fragment  of  the  mouse  Ubrl  gene 
that  corresponds  to  a  1.34-kb  region  of 
mouse  Ubrl  cDNA  (nucleotides  3 15— 
1,454).  Vertical  rectangles  represent  exons. 
Their  lengths,  and  the  lengths  of  the  in- 
trons,  are  indicated,  respectively,  below  and 
above  the  horizontal  line.  In  a  composite 
diagram  of  the  Ubrl  cDNA,  the  exons  arc 
depicted  as  alternatively  shaded  rectangles. 
For  exon  1,  only  its  translated  region  is 
indicated.  Shown  belowthe  cDNA 


diagram  is  a  ~21-kb  fragment  of  the  human  UBRl  gene,  corresponding  to  1.0  kb  of  the  indicated  region  of  the  human  UBR1  cDNA  (nucleotides 
2,218-3,227  of  the  mouse  Ubrl  cDNA  sequence).  The  mouse  and  human  Ubrl  exons  are  denoted,  respectively,  by  numbers  and  letters.  Also  indicated 
are  the  exon  locations  of  some  of  the  type  1  and  type  2  substrate-binding  sites  of  N-recognin  (the  essential  amino  acid  residues  are  underlined)  (A.  Webster, 
M.  Ghislain,  and  A.V.,  unpublished  data;  see  the  main  text).  Not  shown  are  the  114-bp  5 '-untranslated  region  (UTR)  and  the  1,010  bp  3'-UTR  of  the 
mouse  Ubrl  cDNA.  To  isolate  mouse  Ubrl ,  a  library  of  mouse  genomic  DNA  fragments  in  a  BAC  vector  (see  Materials  and  Methods)  was  screened  with 
a  fragment  of  the  mouse  Ubrl  cDNA  (nucleotides  105-1,333)  as  a  probe,  yielding  seven  BAC  clones,  of  which  BAC3  and  BAC4  contained  the  entire 
Ubrl  gene.  The  exon/intron  organization  of  the  first  31  kb  (~1  /4)  of  the  mouse  Ubrl  gene  was  determined  by  using  exon-specific  PCR  primers  to  produce 
~40  genomic  DNA  fragments  of  the  BAC3  insert  that  ranged  in  size  from  1.3  to  18  kb.  Regions  encompassing  the  exon/intron  junctions  then  were 
sequenced  by  using  intron-spccific  primers.  Fragments  of  the  human  genomic  UBR1  DNA  were  isolated  by  using  primers  derived  from  the  1 .0-kb  fragment 
of  the  human  UBR1  cDNA,  the  Expand  High  Fidelity  PCR  System  (Roche  Molecular  Biochemicals,  Indianapolis,  IN),  and  genomic  DNA  from  human 
293  cells.  The  resulting  four  fragments  were  subcloned  into  pCR2.1  (Invitrogen),  yielding  the  plasmids  HRS,  HR6-4,  HR2-25,  and  HR7-2,  whose  partially 
overlapping  inserts  encompassed  ~21  kb  of  the  human  UBR1  gene.  Partial  sequencing  of  the  mouse  and  human  genomic  Ubrl  fragments  (~20  kb  of 
sequenced  DNA)  included  all  of  the  exon/intron  junctions  in  these  regions  of  Ubrl. 
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Fig.  3.  Comparison  of  the  deduced  amino  acid  sequence  of  mouse  UBRlp  (Mm-UBRl)  with  those  of  C.  elegans  UBRlp  (Ce-UBRl),  S. 
cerevisiae  Ubrlp  (Sc-UBRl),  and  K  lactis  Ubrlp  (Kl-UBRl).  White-on-black  and  gray  shadings  highlight,  respectively,  identical  and  similar  residues. 
The  residues  of  UBR  proteins  that  are  identical  to  those  of  S.  cerevisiae  Ubrlp  are  denoted  by  double  dots,  at  positions  where  the  identity  involves 
just  one  non  -cerevisiae  protein.  Also  indicated  are  the  regions  of  significant  similarity  among  the  four  proteins.  K.  lactis  UBR1  was  cloned  through 
its  crosshybridization  to  S .  cerevisiae  UBR1  (P.  Waller  and  A.V.,  unpublished  data). 


(CLONTECH),  and  either  mouse  or  human  Ubrl  cDNA 
fragments  labeled  with  32P  were  used  (20).  Southern  hybrid¬ 
izations  were  carried  out  by  using  standard  techniques  (20). 
Mouse  embryos  were  staged,  fixed,  and  processed  for  in  situ 
hybridization  as  described  (23).  For  sectioning,  the  stained 
embryos  were  embedded  in  OCT  medium  (Sakura  Finetek, 
Torrance,  CA).  A  1.2-kb  Ubrl  cDNA  fragment  (nucleotides 


3,150-3,355)  was  used  as  a  template  for  synthesizing  anti- 
sense-  or  sense-strand  RNA  probes  labeled  with  digoxigenin 
(23). 

Chromosome  Mapping  of  the  Mouse  and  Human  Ubrl.  The 

mapping  of  mouse  Ubrl  was  carried  out  by  using  the  interspecific 
backcross  analysis  (24),  essentially  as  described  (18).  Human 
UBR1  was  mapped  by  using  fluorescence  in  situ  hybridization 
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Fig.  4.  Two  Cys/His  domains  of  the 
UBR  protein  family.  Comparison  of  the 
putative  zinc  finger  (region  I)  and 
RING-H2  finger  (region  IV)  with  the 
corresponding  sequences  from  the  other 
species  in  Fig.  3,  and  also  with  C.  albicans 
Ubrlp  (Ca-UBRl),  C.  elegans  UBR2p 
(Ce-UBR2),  and  S.  cerevisiae  Ubr2p  (Sc- 
UBR2).  Numbers  indicate  the  lengths  of 
gaps.  The  conserved  Cys  and  His  residues 
are  indicated. 


(FLSH)  with  mitotic  chromosomes  from  human  lymphocytes 
(25).  The  probe  was  a  mixture  of  the  HR8,  HR6-4,  HR2-25,  and 
HR7-2  plasmids,  labeled  with  biotin  using  biotinylated  dATP  and 
the  BioNick  labeling  kit  (Life  Technologies,  Grand  Island,  NY), 
and  detected  by  using  fluorescein  isothiocyanate-avidin. 

RESULTS  AND  DISCUSSION 

Isolation  of  the  Mouse  Ubrl  cDNA.  Tryptic  peptides  of  the 
purified  rabbit  E3a  (UBRlp)  (16)  were  isolated  and  sequenced 
(see  Materials  and  Methods ),  yielding  1 4  short  regions  of  E3a  (Fig. 
\A).  These  regions  lacked  significant  similarities  to  the  deduced 
sequence  of  S .  cerevisiae  Ubrlp  (13).  We  used  intrapeptide  PCR 
(21)  to  identify  a  unique  (nondegenerate)  sequence  of  the  rabbit 
Ubrl  cDNA.  This  method  allows  amplification  of  a  short  unique 
DNA  sequence  by  using  two  degenerate  PCR  primers  (derived  by 
reverse  translation)  that  flank  this  sequence  and  correspond  to 
the  outermost  regions  of  a  single  peptide  (Fig.  1#).  Several 
intrapeptide  nucleotide  sequences  were  obtained  this  way  (Fig. 
1).  These  sequences,  together  with  those  of  the  original  degen¬ 
erate  primers,  then  were  used  to  amplify  a  392-bp  fragment  of  the 
rabbit  Ubrl  cDNA,  using  interpeptide  PCR  (Fig.  1).  This  frag¬ 
ment  encoded  peptides  T120  and  T134  at  either  end  and  peptide 
T100  in  the  middle  (Fig.  IB).  A  homologous  392-bp  fragment  of 
the  mouse  Ubrl  cDNA  then  was  amplified  by  using  the  same 
method  (Fig.  IB).  The  rabbit  and  mouse  392-bp  Ubrl  cDNA 
fragments  were  88%  and  89%  identical  at  the  nucleotide  and 
amino  acid  sequence  levels,  respectively,  but  lacked  significant 
similarities  to  S.  cerevisiae  UBR1  (data  not  shown). 

The  392-bp  mouse  Ubrl  cDNA  fragment  then  was  used,  in 
conjunction  with  standard  cDNA  library  screening  and  rapid 
amplification  of  cDNA  ends-PCR  (20),  to  isolate  multiple  Ubrl 
cDNA  fragments,  and  to  assemble  them  into  a  5,271-bp  ORF 
encoding  a  1,757-residue  protein  (pi  of  6.0),  whose  size,  200  kDa, 
was  close  to  the  estimated  size  of  the  isolated  rabbit  UBRlp 
(E3a),  ~180  kDa  (16)  (Figs.  2  and  3).  The  inferred  ATG  start 
codon  (Fig.  2),  within  the  sequence  CTTAAGATGGCG,  is 
preceded  by  two  in-framc  stop  codons,  at  positions  —48  and  —93, 
and  is  located  in  a  favorable  Kozak  context  (26),  with  A  and  G 
at  positions  -3  and  +4,  respectively.  There  are  two  more  ATGs, 
five  and  11  codons  downstream  of  the  inferred  one.  These 
alternative  start  codons  are  in  a  favorable  Kozak  context  as  well. 

Cloning  and  Partial  Characterization  of  the  Mouse  and  Hu¬ 
man  Ubrl  Genes.  A  fragment  of  the  mouse  Ubrl  cDNA  was  used 
to  isolate  a  ^120-kb  mouse  Ubrl  genomic  DNA  clone,  carried  in 
a  BAC  vector  (22).  We  determined  the  exon/intron  organization 
and  restriction  map  of  the  ~31-kb  region  of  Ubrl  that  corre¬ 
sponded  to  the  1,340-bp  5'-region  of  the  mouse  Ubrl  cDNA 
(nucleotides  105-1,333)  (Fig.  2).  The  lengths  of  the  12  exons  in 
this  region  of  mouse  Ubrl  range  from  63  to  257  bp  (Fig.  2). 

The  nucleotide  and  deduced  amino  acid  sequences  of  the 
1.0-kb  human  UBR1  cDNA  fragment  (see  Materials  and  Meth¬ 
ods ),  located  approximately  in  the  middle  of  UBR1  cDNA 
(nucleotides  2,218-3,227  of  the  mouse  Ubrl  cDNA)  (Fig.  2), 
were,  respectively,  91%  and  94%  identical  to  the  corresponding 
mouse  Ubrl  cDNA  and  UBR  Ip  sequences.  Overlapping  genomic 


DNA  fragments  of  human  UBR1  that,  together,  encompassed  a 
^21 -kb  region  of  the  human  UBR1  gene  and  corresponded  to  the 
1.0-kb  fragment  of  the  human  UBR1  cDNA  (Fig.  2),  were  isolated 
from  human  DNA  by  using  cDNA-derived  primers  and  PCR. 
Partial  sequencing  showed  that  this  ^21-kb  region  of  human 
UBR1  contained  11  exons  whose  length  ranged  from  49  to  155  bp, 
a  distribution  of  exon  lengths  similar  to  that  in  a  different  region 
of  mouse  Ubrl  (Fig.  2).  All  of  the  sequenced  exon/intron 
junctions  (^23  exons),  which  encompassed  a  ^52-kb  region  of 
the  mouse  and  human  Ubrl ,  contained  the  consensus  GT  and  AG 
dinucleotides  characteristic  of  the  mammalian  nuclear  pre- 
mRNA  splice  sites  (data  not  shown)  (20).  Extrapolating  from 
these  data  on  the  mouse  and  human  Ubrl  genes  and  the  corre¬ 
sponding  regions  of  their  cDNAs  (Fig.  2),  a  mammalian  Ubrl 
gene  is  expected  to  be  ^120  kb  long  and  to  contain  ~50  exons. 

The  Mouse  UBRlp  Protein  and  its  Homologs.  The  low  overall 
sequence  similarity  of  mouse  UBRlp  (E3a)  to  Ubrlp  of  either  S. 
cerevisiae  [22%  identity  (id.),  48%  similarity  (si.)]  or  another 
budding  yeast,  Kluyveromyces  lactis  (21%  id.,  48%  si.),  belied  the 
presence  of  five  regions,  denoted  I-V,  which  were  significantly 
similar  between  the  mouse  and  yeast  versions  of  UBRlp  (Figs.  3 
and  4).  By  contrast,  the  Ma/J-encoded  asparagine-specific  N- 
terminal  amidase,  the  most  upstream  component  of  the  mouse 
N-end  rule  pathway,  lacks  sequence  similarities  to  its  S .  cerevisiae 
counterpart  Ntalp  (12,  18).  Database  searches  identified  other 
likely  homologs  of  mouse  UBRlp,  in  particular  the  1,927-residue 
protein  of  the  nematode  Caenorhabditis  elegans  (GenBank  ac¬ 
cession  no.  U88308)  (32%  id.,  53%  si.;  termed  Ce-Ubrl)\  the 
1,872-residue  S.  cerevisiae  protein  (GenBank  accession  no. 
Z73196)  (21%  id.,  47%  si.;  termed  Sc-UBR2\  ref.  4);  the  2,168- 
residue  C.  elegans  protein  (GenBank  accession  no.  U40029)  (21% 
id.,  45%  si.;  termed  Ce-Ubr2 );  and  the  794-residue  CER3p 
protein  of  the  plant  Arabidopsis  thaliana  (GenBank  accession  no. 
X95962)  (26%  id.,  49%  si.).  CER3p  is  involved  in  wax  biosyn¬ 
thesis  in  ,4  thaliana  (27).  In  addition,  a  147-residue  sequence  of 
the  yeast  Candida  albicans  (http://alces.med.umn.edu/bin/ 
genelist?LUBRl)  was  similar  to  the  N-terminal  region  of  mouse 
UBRlp  (Fig.  4). 

The  presence  of  high-similarity  regions  I-V  among  these 
deduced  sequences  (Figs.  3  and  4)  suggested  the  existence  of  a 
distinct  protein  family,  termed  UBR.  The  66-residue  region  I, 
near  the  N  terminus  of  UBRlp,  is  a  particularly  clear  UBR 
family-identifying  region  (e.g.,  61%  id.,  75%  si.  between  mouse 
and  C.  elegans  UBRlp)  (Figs.  3  and  4). 

Recent  genetic  analyses  of  S.  cerevisiae  Ubrlp  (N-recognin) 
have  shown  that  the  regions  I— III  contain  residues  essential  for 
the  recognition  of  N-end  rule  substrates  by  Ubrlp.  In  particular, 
Cys-145,  Val-146,  Gly-173,  and  Asp-176  of  region  I  were  identi¬ 
fied  as  essential  residues  of  the  type  1  binding  site  of  S.  cerevisiae 
Ubrlp  (A.  Webster,  M.  Ghislain,  and  A.V.,  unpublished  data). 
All  four  of  these  residues  were  conserved  between  the  yeast, 
mouse,  and  C.  elegans  UBRlp  (Figs.  3  and  4).  Region  I  is  present 
in  all  of  the  known  UBR  family  members  except  CER3p  of  A. 
thaliana ,  which  contains  only  regions  IV  and  V  (Fig.  4).  Region 
I  encompasses  a  Cys/His-rich  domain,  Cys-Xi2-Cys-X2-Cys-X5- 


Fig.  5.  Northern  and  in  situ  hybridizations  with  mouse  and  human  Ubrl .  (A)  Membranes  containing  electrophoretically  fractionated  poly(A)+ 
mRNA  from  different  mouse  (a-c)  or  human  ( d  and  e)  tissues  were  hybridized  with  either  a  2-kb  5'-proximal  (nucleotides  116-2,124)  mouse  Ubrl 
cDNA  fragment  (a),  its  0.64-kb  3'-proximal  (nucleotides  4,749-5,388)  fragment  (b),  a  1-kb  human  UBR1  cDNA  fragment  (d),  or  the  human  j3-actin 
cDNA  fragment  (c  and  e).  The  upper  arrows  in  a  and  d  indicate  the  *«8-kb  Ubrl  transcript.  The  lower  arrow  in  a  indicates  the  ^6-kb  testis-specific 
Ubrl  transcript.  In  the  RNA  sample  from  mouse  spleen,  the  Ubrl  transcript  (but  not  the  actin  transcript)  may  have  been  degraded  (a-c).  ( B ) 
Expression  of  Ubrl  in  el0.5  and  el  1.5  mouse  embryos.  Whole-mount  in  situ  hybridization  was  carried  out  with  either  antisense  (AS)  or  sense  (S, 
negative  control)  Ubrl  cDNA  probes  (see  Materials  and  Methods).  The  regions  of  high  Ubrl  expression  are  indicated  by  arrows  (t,  tail;  fl,  forelimb 
buds;  hi,  hindlimb  buds).  The  branchial  arches,  where  Ubrl  is  also  highly  expressed  in  el0.5  embryos  (data  not  shown),  are  not  visible  in  this  el0.5 
embryo.  (C)  Expression  of  Ubrl  in  the  surface  ectoderm  of  limb  buds.  Shown  is  a  transverse  section  of  a  forelimb  bud  of  an  el0.5  embryo  (se, 
surface  ectoderm).  (D)  FISH  analysis  of  human  UBRL  (Upper)  An  example  of  the  UBRl- specific  FISH  signal  (arrow).  (Lower)  The  same  mitotic 
spread  stained  with  4'-6-diamino-2-phenylindole  (DAPI)  to  visualize  the  chromosomes  (see  also  Fig.  6). 


Cys-X2-Cys-X2-Cys-X5-His-X2-His-X(i2-i4)-Cys-XrCys-XirCys 
(Figs.  3  and  4),  which  is  distinct  from  the  known  consensus 
sequences  of  zinc  fingers  and  other  Cys/His-motifs.  Residues 
Asp-318,  His-321,  and  Glu-560  of  S.  cerevisiae  Ubrlp,  which  have 
been  identified  as  essential  for  the  type  2  binding  site  of  this 
N-recognin  (A.  Webster,  M.  Ghislain,  and  A.V.,  unpublished 
data),  were  found  to  be  retained  in  region  II  (Asp-318  and 
His-321)  and  region  III  (Glu-560)  of  the  mouse  and  C.  elegans 
UBRlp  (Fig.  3). 

Region  IV  contains  another  Cys/His-rich  domain  of  UBRlp, 
Cys-X2-Cys-loop  i-Cys-Xi-His-X2-His-X2-Ctys-/<9C>/?  2-Cys-X2-Cys 
(Figs.  3  and  4),  which  is  present  in  all  of  the  UBR  family  members, 
and  fits  the  consensus  sequence  of  the  RING-H2  finger,  a 
subfamily  of  the  previously  defined  RING  motif  (28).  At  least 
some  of  the  RING-H2  sequences  are  sites  of  specific  protein- 
protein  interactions  (28).  Apcllp,  a  subunit  of  the  Ub-protein 
ligase  complex  called  the  cyclosome  (2)  or  the  anaphase  promot¬ 
ing  complex,  also  contains  a  RING-H2  finger  (29). 

Another  area  of  similarity  (24-50%  id.,  46-70%  si.)  among  the 
UBR  family  members  is  region  V  (Fig.  3  and  data  not  shown). 
This  region,  115  residues  long  in  mouse  UBRlp,  near  the 
protein’s  C  terminus,  is  particularly  similar  between  mouse  and  C. 
elegans  UBRlp  (50%  id.,  70%  si.)  (Fig.  3).  Region  V  is  located 
4-14  residues  from  the  UBR  proteins’  C  termini,  the  exceptions 
being  the  S.  cerevisiae  and  K  lactis  Ubrlp,  which  bear,  respec¬ 
tively,  132-  and  159-residue  tails  of  unknown  function  that  are  rich 
in  the  acidic  Asp/Glu  residues  (36%  and  33%)  (Fig.  3).  No 
significant  similarities  could  be  detected  between  mammalian 
UBRlp  and  other  E3s  (recognins)  of  the  metazoan  Ub  system, 


including  E6AP  (30)  and  subunits  of  the  cyclosome/anaphase 
promoting  complex,  except  for  the  presence  of  a  RING-H2  finger 
domain  in  the  latter  (29).  [Different  E3  proteins  of  the  Ub  system 
recognize  different  degrons  in  protein  substrates,  thereby  defin¬ 
ing  distinct  Ub-dependent  proteolytic  pathways  (1,  4).] 

Expression  of  Mouse  and  Human  Ubrl.  The  5'-  and  3'- 
proximal  mouse  cDNA  probes  yielded  similar  results,  detecting  a 
single  ^8-kb  transcript  in  several  tissues  (Fig.  5  Aa  and Ab).  In  the 
testis,  however,  the  ^8-kb  species  of  Ubrl  mRNA  was  a  minor 
one,  the  major  species  being  ^6  kb  (Fig.  5Aa).  The  levels  of  either 
mouse  or  human  Ubrl  mRNA  were  highest  in  skeletal  muscle  and 
heart  (Fig.  5/4).  The  expression  of  mRNA  encoding  E2i4K,  one  of 
the  mouse  Ub-conjugating  (E2)  enzymes  and  a  likely  component 
of  the  mouse  N-end  rule  pathway  (6),  was  also  highest  in  skeletal  f 

muscle  and  heart  (18). 

The  distinct  Ubrl  mRNA  pattern  in  the  testis  (Fig.  5A)  was 
reminiscent  of  the  analogous  expression  pattern  of  Ntanl 
mRNA,  which  encodes  the  Asn-specific  N-terminal  amidase, 
another  component  of  the  mammalian  N-end  rule  pathway. 
Specifically,  the  size  of  the  major  species  of  Ntanl  mRNA  was 
«*1.4  kb  in  all  of  the  examined  mouse  tissues  except  testis, 
where  the  major  species  was  «*1.1  kb  (18).  The  ^1.1-kb  Ntanl 
transcript  recently  was  found  to  hybridize  only  to  the  3' -half 
(exons  6-10  but  not  exons  1-5)  of  the  Ntanl  ORF  (Y.T.K.  and 
A.V.,  unpublished  data).  The  functional  significance  of  the 
testis-specific  Ubrl  and  Ntanl  expression  patterns  remains  to 
be  understood. 

We  used  whole-mount  in  situ  hybridization  to  examine  the 
expression  of  Ubrl  during  embryogenesis.  In  e9.5  (9.5  days  old) 
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Fig.  6.  Chromosomal  locations  of  the  mouse  and  human  Ubrl 
genes.  (A)  Mouse  Ubrl  was  mapped  to  the  middle  of  mouse  chromo¬ 
some  2  by  using  interspecific  (M.  musculus-M.  spretus)  backcross 
analysis  (18,  24).  Shown  are  the  segregation  patterns  of  mouse  Ubrl 
and  the  flanking  genes  in  66  backcross  animals  that  were  typed  for  all 
loci.  For  individual  pairs  of  loci,  more  than  66  animals  were  typed. 
Bach  column  represents  the  chromosome  identified  in  the  backcross 
progeny  that  was  inherited  from  the  [M.  musculus  C57BL/6J  X  M. 
spretus]  Fi  parent.  Filled  and  empty  squares  represent,  respectively, 
C57BL/6J  and  M.  spretus  alleles.  The  numbers  of  offspring  that 
inherited  each  type  of  chromosome  2  arc  listed  below  the  columns.  (5) 
A  partial  mouse  chromosome  2  linkage  map  (MMU2),  showing  Ubrl 
in  relation  to  the  linked  genes  Thbsl ,  Ebp4.2 ,  and  B2m ,  and  also,  on 
the  left,  the  corresponding  recombination  distances  between  the  loci, 
in  centimorgans,  and  the  map  locations,  in  parentheses.  (C)  A  partial 
human  chromosome  15  linkage  map  (HSA15).  Bach  dot  on  the  right, 
in  the  15ql5-q21.1  region,  corresponds  to  the  actually  observed 
UBR 1  -specific  double-dot  FISH  signal  detected  on  human  chromo¬ 
some  15  (see  also  Fig.  5D). 

mouse  embryos,  the  expression  of  Ubrl  was  highest  in  the 
branchial  arches  and  in  the  buds  of  forelimbs  and  the  tail  (data 
not  shown).  In  el 0.5  embryos,  the  expression  of  Ubrl  became 
high  in  the  hindlimb  buds  as  well  (Fig.  5B).  This  pattern  was 
maintained  in  the  limb  buds  of  el  1.5  embryos  (Fig.  5B ).  High 
expression  of  Ubrl  in  the  limb  buds  was  confined  predominantly 
to  the  surface  ectoderm  (Fig.  5C).  This  pattern  of  Ubrl  expression 
in  embryos  (Fig.  5  B  and  C)  is  similar,  if  not  identical,  to  that  of 
Ntanl ,  which  encodes  asparagine-specific  N-terminal  amidase 
(18)  (Y.T.K.  and  A.V.,  unpublished  data),  consistent  with 
UBRlp  and  NTANlp  being  components  of  the  same  pathway. 

The  enhanced  expression  of  Ubrl  in  the  embryonic  limb  buds 
(Fig.  5  B  and  C)  is  interesting  in  view  of  the  conjecture  that  the 
N-end  rule  pathway  might  be  required  for  limb  regeneration  in 
amphibians  (31).  The  injection  of  dipeptides  bearing  destabilizing 
N-terminal  residues  into  the  stumps  of  amputated  forelimbs  of 
the  newt  was  observed  to  delay  limb  regeneration,  whereas  the 
injection  of  dipeptides  bearing  stabilizing  N-terminal  residues  had 
no  effect  (31).  Rigorous  tests  of  this  and  other  suggested  functions 
of  the  metazoan  N-end  rule  pathway  (6)  will  require  mouse 
strains  that  lack  Ubrl. 

Chromosome  Mapping  of  Mouse  and  Human  Ubrl .  The 

chromosomal  location  of  mouse  Ubrl  was  determined  by  inter¬ 
specific  backcross  analysis,  using  DNA  derived  from  matings  of 
[(C57BL/6J  X  Mus  spretus) Fi  X  C57BL/6J]  mice  (Fig.  6  A  and 
B)  (18,  24).  Mouse  Ubrl  is  located  in  the  central  region  of 
chromosome  2  and  is  linked  to  the  Thbsl,  Epb4.2,  and  B2m  genes, 
the  most  likely  gene  order  being  centromer e-Thbsl-Ubrl-Epb4.2- 
B2m  (Fig.  6 B  and  data  not  shown). 

The  chromosomal  location  of  human  UBRI  was  determined 
by  using  FISH  (25),  with  human  UBRI  genomic  DNA  frag¬ 
ments  as  probes  (Figs.  5 D  and  6C).  This  mapping  placed  UBRI 
at  the  15ql5-15q21.1  region  of  the  human  chromosome  15,  an 
area  syntenic  with  the  independently  mapped  position  of 
mouse  Ubrl  (Fig.  6).  Ubrl  is  located  in  the  regions  of  human 
chromosome  15  and  mouse  chromosome  2  that  appear  to  be 
devoid  of  the  previously  mapped  but  uncloned  mutations. 


Mutations  in  the  human  gene  CANP3 ,  which  encodes  a  subunit 
of  calpain  and  is  located  very  close,  if  not  adjacent,  to  UBRI , 
have  been  shown  to  cause  a  myopathy  called  the  limb-girdle 
muscular  distrophy  (32). 

Concluding  Remarks.  Isolation  of  the  mouse  and  human  Ubrl 
cDNAs  and  genes  (Figs.  2-6)  should  enable  functional  under¬ 
standing  of  the  mammalian  N-end  rule  pathway,  in  part  through 
the  construction  and  analysis  of  mouse  strains  that  lack  Ubrl. 
Recent  searches  in  GenBank  identified  several  mouse  and  human 
sequences  in  expressed  sequence  tag  databases  that  exhibited 
significant  similarity  to  the  C-terminal  region  of  mouse  UBRlp. 
The  cloning  and  characterization  of  the  corresponding  cDNAs 
have  shown  that  there  exist  at  least  two  distinct  mouse  (and 
human)  genes,  termed  Ubr2  and  Ubr3,  which  encode  proteins  that 
are  significantly  similar  to  mouse  UBRlp  (Y.T.K.  and  A.V., 
unpublished  data).  Molecular  and  functional  analyses  of  these 
Ubrl  homologs  are  under  way. 
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Ubiquitin-dependent  proteolytic  systems  underlie 
many  processes,  including  the  cell  cycle,  cell  differenti¬ 
ation  and  responses  to  stress.  One  such  system  is  the 
N-end  rule  pathway,  which  targets  proteins  bearing 
destabilizing  N-terminal  residues.  Here  we  report  that 
Ubrlp,  the  main  recognition  component  of  this  path¬ 
way,  regulates  peptide  import  in  the  yeast  Saccharo- 
myces  cerevisiae  through  degradation  of  Cup9p,  a  35 
kDa  homeodomain  protein.  Cup9p  was  identified  using 
a  screen  for  mutants  that  bypass  the  previously 
observed  requirement  for  Ubrlp  in  peptide  import. 
We  show  that  Cup9p  is  a  short-lived  protein  ( tll2 
-5  min)  whose  degradation  requires  Ubrlp.  Cup9p 
acts  as  a  repressor  of  PTR2 ,  a  gene  encoding  the 
transmembrane  peptide  transporter.  In  contrast  to 
engineered  N-end  rule  substrates,  which  are  recognized 
by  Ubrlp  through  their  destabilizing  N-terminal 
residues,  Cup9p  is  targeted  by  Ubrlp  through  an 
internal  degradation  signal.  The  Ubrlp-Cup9p-Ptr2p 
circuit  is  the  first  example  of  a  physiological  process 
controlled  by  the  N-end  rule  pathway.  An  earlier  study 
identified  Cup9p  as  a  protein  required  for  an  aspect 
of  resistance  to  copper  toxicity  in  S. cerevisiae.  Thus, 
one  physiological  substrate  of  the  N-end  rule  pathway 
functions  as  both  a  repressor  of  peptide  import  and  a 
regulator  of  copper  homeostasis. 

Keywords :  CUP9/N -end  rule/peptide  import/proteolysis/ 
PTR2IUBR1 


Introduction 

Many  regulatory  proteins  are  short-lived  in  vivo  (Schwob 
et  al ,  1994;  King  et  al ,  1996;  Varshavsky,  1996).  This 
metabolic  instability  makes  possible  rapid  adjustment 
of  the  protein’s  concentration  (or  subunit  composition) 
through  changes  in  the  rates  of  its  synthesis  or  degradation. 
Protein  degradation  plays  a  role  in  a  multitude  of  processes, 
including  cell  growth,  division,  differentiation  and 
responses  to  stress.  In  eukaryotes,  a  large  fraction  of 
intracellular  proteolysis  is  mediated  by  the  ubiquitin  sys¬ 
tem.  Ubiquitin  (Ub)  is  a  76-residue  protein  whose  covalent 
conjugation  to  other  proteins  marks  them  for  processive 
degradation  by  the  26S  proteasome — an  ATP-dependent, 
multisubunit  protease  (Jentsch  and  Schlenker,  1995;  Hilt 
and  Wolf,  1996;  Hochstrasser,  1996;  Rubin  et  al,  1997). 


Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals  (degrons).  One  of  the 
degradation  signals  recognized  by  the  ubiquitin  system  is 
called  the  N-degron.  It  comprises  two  essential  determin¬ 
ants:  a  destabilizing  N-terminal  residue  and  an  internal 
lysine  of  a  substrate  (Bachmair  et  al ,  1986;  Varshavsky, 
1996).  The  Lys  residue  is  the  site  of  formation  of  a 
substrate-linked  multiubiquitin  chain  (Bachmair  and 
Varshavsky,  1989;  Chau  et  al ,  1989).  A  set  of  N-degrons 
bearing  different  N-terminal  residues  that  are  destabilizing 
in  a  given  cell  type  yields  a  rule,  called  the  N-end  rule, 
which  relates  the  in  vivo  half-life  of  a  protein  to  the 
identity  of  its  N-terminal  residue.  Similar  but  distinct 
versions  of  the  N-end  rule  operate  in  all  organisms 
examined,  from  mammals  to  fungi  and  bacteria 
(Varshavsky,  1996). 

The  N-end  rule  pathway  is  organized  hierarchically.  In 
eukaryotes  such  as  Saccharomyces  cerevisiae ,  Asn  and 
Gin  are  tertiary  destabilizing  N-terminal  residues  in  that 
they  function  through  their  conversion,  by  the  NTA1- 
encoded  N-terminal  amidase  (Nt-amidase),  into  the 
secondary  destabilizing  residues  Asp  and  Glu  (Baker  and 
Varshavsky,  1995).  Secondary  residues,  in  turn,  function 
through  their  conjugation  to  Arg  by  the  ATE1 -encoded 
Arg-tRNA-protein  transferase  (R-transferase)  (Balzi  et  al , 
1990).  Arg  is  one  of  several  primary  destabilizing 
N-terminal  residues  which  are  bound  directly  by 
N-recognin,  a  225  kDa  E3  protein  encoded  by  the  UBR1 
gene  (Bartel  et  al ,  1990).  Ubrlp,  together  with  the 
associated  ubiquitin-conjugating  (E2)  enzyme  Ubc2p, 
mediates  the  formation  of  a  substrate-linked  multiubiquitin 
chain  (Varshavsky,  1996). 

The  N-end  rule  pathway  was  first  encountered  in  experi¬ 
ments  that  explored,  in  S.cerevisiae ,  the  metabolic  fate  of 
a  fusion  between  Ub  and  a  reporter  such  as  Escherichia 
coli  (3-galactosidase  (P-gal)  (Bachmair  et  al,  1986).  While 
such  engineered  N-end  rule  substrates  have  been  extens¬ 
ively  characterized  (Varshavsky,  1996),  little  is  known 
about  their  physiological  counterparts.  The  few  identified 
so  far  include  the  GPA1 -encoded  Ga  subunit  of  the 
S.cerevisiae  heterotrimeric  G  protein,  which  mediates  the 
pheromone  response  in  this  fungus,  and  RNA  polymerases 
of  alphaviruses  whose  hosts  include  mammalian  and  insect 
cells  (de  Groot  et  al ,  1991;  Madura  and  Varshavsky, 

1994) .  Physiological  functions  of  the  instability  of  these 
proteins  remain  to  be  understood  (Varshavsky,  1996). 
Inactivation  of  the  N-end  rule  pathway  in  S.cerevisiae — 
through  deletion  of  the  UBR1  gene — results  in  cells  which 
grow  slightly  slower  than  their  wild-type  counterparts, 
and  are  impaired  in  sporulation  (increased  frequency  of 
asci  containing  fewer  than  four  spores),  but  otherwise 
appear  to  be  normal  (Bartel  et  al ,  1990). 

Recently,  Becker  and  colleagues  (Alagramam  et  al , 

1995)  have  reported  that  ubrlA  cells  are  deficient  in  the 
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Fig.  1.  The  import  of  peptides  is  decreased  in  the  absence  of  Ubc2p  and  virtually  abolished  in  the  absence  of  Ubc2p  and  Ubc4p.  Sacchawmyces 
cerevisiae  mutants  deficient  in  one  or  more  ubiquitin-conjugating  (E2)  enzymes  or  other  components  of  the  ubiquitin  system  were  tested  for  then 
ability  to  import  peptides,  using  the  halo  assay  and  a  toxic  dipeptide  L-leucyl-L-ethionine  (Leu-Eth)  (see  Materials  and  methods).  +  , 
denote,  respectively,  the  apparently  wild-type,  significantly  reduced,  and  undetectable  levels  of  peptide  import.  A  superscript  1  refers 
carried  a  a  allele  of  an  essential  E2  enzyme,  either  Cdc34P  (Ubc3p)  or  Ubc9p;  the  30°C  temperature  of  the  test  was  semi-pernnssive  forthese 
strains.  (A)  Summary  of  the  results.  (B-F)  Examples  of  the  actual  halo  assays,  with  wild-type  (B),  ptr2 A  (C),  ubrlA  (D),  ubc2 A  (E),  ubc4A  (F)  and 
ubc2A  ubc4A  (G)  strains  of  S.cerevisiae.  Elimination  of  some  E2  enzymes  in  the  ubc2A  background  restored  halo  formation  presumably  because 
such  strains  were  growth-impaired  in  a  way  that  made  them  hypersensitive  to  the  toxicity  of  ethionine  By  contrast,  although  ubc2A  ubc4A  cells 
were  also  growth-impaired,  they  were  import-defective  and  therefore  grew  in  the  immediate  vicinity  of  the  filter. 


import  of  di-  and  tripeptides,  suggesting  that  this  process, 
which  is  universal  among  living  cells,  requires  the  N-end 
rule  pathway.  In  the  present  work,  we  identified  the 
underlying  regulatory  mechanism  and  discovered  a  new 
physiological  substrate  of  the  N-end  rule  pathway,  the 
homeodomain  protein  Cup9p.  This  short-lived  protein 
is  targeted  for  degradation  by  Ubrlp,  and  acts  as  a 
transcriptional  repressor  of  PTR2 ,  a  gene  that  encodes  a 
transmembrane  peptide  transporter.  The  Ubrlp-Cup9p- 
Ptr2p  circuit  is  the  first  example  of  a  physiological  process 
controlled  by  the  N-end  rule  pathway. 

Results 

The  involvement  of  Ubc2p  and  Ubc4p  E2  enzymes 
in  the  control  of  peptide  import 

We  began  by  asking  whether  components  of  the  N-end 
rule  pathway  other  than  Ubrlp  were  also  necessary  for 
the  import  of  peptides.  Previous  work  has  shown  that 
Ubc2p,  one  of  13  ubiquitin-conjugating  (E2)  enzymes  of 
S.cerevisiae ,  is  required  for  the  degradation  of  engineered 
N-end  rule  substrates,  and  is  physically  associated  with 
the  E3  protein  Ubrlp  (N-recognin)  (Jentsch,  1992;  Madura 
etal,  1993). 

To  test  for  the  ability  of  S.cerevisiae  to  import  peptides, 
we  used  a  halo  assay,  in  which  a  filter  soaked  in  the  toxic 
dipeptide  L-leucyl-L-ethionine  (Leu-Eth)  is  placed  on  a 
plate,  inhibiting  the  growth  of  import-competent  cells  near 
the  filter.  By  this  test,  the  elimination  of  UBC2  impaired, 
but  did  not  abolish,  the  import  of  peptides  (Figure  1A,  B 
and  E).  To  determine  which  of  the  other  E2  enzymes,  if 


any,  were  required  for  the  residual  peptide  import  observed 
in  ubc2A  cells,  a  number  of  single  and  multiple  mutants 
in  UBC  genes  were  examined  (Figure  1).  We  found 
that  the  elimination  of  both  UBC2  and  UBC4  virtually 
abolished  the  import  of  peptides  (Figure  1A,  B  and 
E-G).  Elimination  of  Ubc4p,  one  of  the  more  abundant 
E2  enzymes  (Bachmair  et  at,  1986;  Jentsch,  1992),  had 
previously  been  noticed  to  decrease  slightly  the  activity 
of  the  N-end  rule  pathway  (Bartel,  1990).  Cells  lacking 
either  Ntalp  or  Atelp— the  ‘upstream’  components  of  this 
pathway — were  also  examined  and  found  unimpaired  in 
the  import  of  peptides,  in  contrast  to  ubrlA  and  ubc2A 
ubc4A  cells  (Figure  1A). 

identification  of  Cup9p  as  a  negative  regulator  of 
peptide  import 

Previous  work  (Alagramam  et  al. ,  1995)  has  shown  that 
deletion  of  UBR1  greatly  reduces  the  level  of  PTR2  mRNA, 
which  encodes  the  transmembrane  peptide  transporter.  This 
result,  and  our  observation  that  peptide  import  requires 
the  presence  of  at  least  one  of  two  specific  ubiquitin- 
conjugating  enzymes  (Figure  1),  suggested  a  model  in 
which  expression  of  the  Ptr2p  transporter  is  regulated  by 
a  short-lived  repressor  that  is  degraded  by  the  N-end  rule 
pathway.  In  cells  lacking  UBR1 ,  the  repressor  would  be 
expected  to  accumulate,  thereby  blocking  peptide  import. 
One  prediction  of  this  model  is  that  inactivation  of  this 
repressor  would  bypass  the  requirement  for  Ubrlp  in 
peptide  import.  A  screen  for  such  ‘bypass’  mutations  (see 
Materials  and  methods)  yielded  199  recessive  isolates,  of 
which  101  defined  one  complementation  group,  termed 
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Fig.  2.  Cup9p  is  a  repressor  of  the  PTR2  gene.  (A)  Expression  of  the  S.cerevisiae  PTR2  gene  in  different  genetic  backgrounds.  Equal  amounts  of 
total  RNA  isolated  from  different  strains  were  analyzed  by  Northern  hybridization  (see  Materials  and  methods),  using  the  PTR2  (peptide  transporter) 
and  ACT 1  (actin)  genes  as  32P-labeled  probes.  Lane  a,  JD52  ( CUP9  UBR1)  (wild-type)  transformed  with  pCB201  (empty  vector).  Lane  b,  JD55 
{CUP9  ubrlA)  transformed  with  pCB201.  Lane  c,  JD52  ( CUP9  UBR1)  transformed  with  the  high-copy  plasmid  pCB209  that  expressed  CUP9  from 
its  natural  promoter.  Lane  d,  CBY18  ( cup9A  UBR1).  Lane  e,  CBY16  {cup9A  ubrlA).  Lane  f,  CBY21  ( CUP9  UBR1  ptr2A).  Lanes  g-i,  same  as  lanes 
a-c,  but  a  longer  autoradiographic  exposure  to  highlight  the  tight  repression  of  PTR2  by  Cup9p  in  ubrl  A  cells  (b,  h)  and  the  difference  between 
levels  of  PTR2  mRNA  in  the  wild-type  cells  (a,  g)  and  their  counterparts  that  overexpressed  Cup9p  (c,  i).  (B)  Cup9p  specifically  binds  to  a  site  in 
the  PPTR2  promoter.  A  gel  shift  assay  with  Cup9-H6,  poly-dl-dC  and  32P-labeled  DNA  fragments  of  the  PTR2  promoter  hybridization  (see  Materials 
and  methods).  Lanes  a-c,  with  a  fragment  (-1  to  -447)  proximal  to  the  inferred  start  codon  of  the  PTR2  ORF.  Lanes  d-f,  same  as  lanes  a-c,  but 
with  a  more  distal  DNA  fragment  {~AA 8  to  -897).  Concentrations  of  Cup9-H6  (in  fig/ml)  are  indicated  above  the  lanes. 


subl  (suppressor  of  a  block  to  peptide  import  in  ubrlA). 
In  agreement  with  the  model’s  prediction,  subl  mutants 
acquired  the  ability  to  express  PTR2  in  the  absence  of 
UBR1  (Figure  2A,  lane  b  versus  lane  e).  The  subl 
locus  was  cloned  by  complementation  (see  Materials  and 
methods),  and  was  found  to  be  the  CUP9  gene. 

CUP9  was  originally  identified  by  Knight  et  al  (1994) 
as  a  gene  whose  disruption  impairs  the  copper  resistance 
of  S.cerevisiae  growing  on  lactate,  a  non-fermentable 
carbon  source.  Under  these  conditions,  Cup9p  plays  a 
major  (but  mechanistically  obscure)  role  in  copper 
homeostasis  (Knight  et  al ,  1994).  CUP9  encodes  a  35  kDa 
protein  that  contains  a  homeodomain,  an  -60-residue 
helix-turn-helix  DNA-binding  motif  present  in  many 
eukaryotic  regulatory  proteins  (Wolberger,  1996).  Outside 
the  homeodomain  region,  the  sequence  of  Cup9p  is  not 
similar  to  sequences  in  databases. 

To  verify  that  CUP9  and  SUB1  were  the  same  gene, 
complementation  tests  were  carried  out.  Two  independ¬ 
ently  derived  ubrlA  cup9::LEU2  strains  (CBY16  and 
CBY17)  were  crossed  to  ubrlA  subl-1  (CBY15),  and  the 
resulting  diploids  (CBY23  and  CB  Y24,  respectively)  were 
tested  for  their  ability  to  import  dipeptides  (see  Materials 
and  methods).  As  would  be  expected  of  allelic  loci, 
cup9::LEU2  and  subl-1  failed  to  complement  one  another: 
both  diploid  strains  remained  import-competent  (data 
not  shown).  In  another  test,  CBY23  and  CBY24  were 
sporulated,  and  the  segregants  were  analyzed  for  the 
presence  of  the  LEU2  gene  (integrated  at  the  CUP9  locus) 
and  for  the  ability  to  import  peptides.  Among  the  eight 
tetrads  tested,  LEU2  was  present  in  two  of  the  four 
segregants,  whereas  all  four  segregants  were  invariably 
import-competent,  a  pattern  expected  if  CUP9  and  SUB1 
were  one  and  the  same  gene. 


To  examine  the  regulation  of  peptide  import  by  Cup9p 
and  Ubrlp,  congenic  S.cerevisiae  strains  that  lacked, 
expressed  or  overexpressed  Cup9p  and/or  Ubrlp  were  con¬ 
structed  and  assayed  for  peptide  import  by  growth  on  select¬ 
ive  media.  Suspensions  of  cells  (auxotrophic  for  lysine) 
were  serially  diluted  and  plated  on  either  rich  media,  min¬ 
imal  media  lacking  lysine  and  containing  Lys- Ala  dipeptide 
(selecting  for  peptide  import),  or  minimal  media  containing 
the  toxic  dipeptide  Leu-Eth  (selecting  against  peptide 
import).  All  strains  grew  at  comparable  rates  on  rich  media 
(Figure  3A).  On  minimal  media  that  supplied  the  essential 
lysine  as  the  Lys-Ala  dipeptide,  the  CUP9  UBR1  (wild- 
type),  cup9A  UBR1  and  cup9A  ubrlA  strains  grew  at  com¬ 
parable  rates  (Figure  3B),  whereas  the  CUP9  ubrlA  strain 
failed  to  grow  (Figure  3B),  in  agreement  with  the  observa¬ 
tion  that  UBR1  is  required  for  peptide  import  (Figure  1). 
Opposite  growth  patterns  were  observed  on  media  con¬ 
taining  toxic  dipeptide  (Figure  3C).  Comparison  of  the  data 
in  Figure  3  with  PTR2  mRNA  levels  (Figure  2A)  suggested 
that  CUP9  exerts  its  effect  on  the  import  of  peptides  by 
repressing  transcription  of  PTR2.  For  example,  wild-type 
(CUP9  UBR1)  cells  overexpressing  Cup9p  exhibited 
reduced  levels  of  PTR2  mRNA  (Figure  2A)  and  decreased 
sensitivity  to  toxic  dipeptide  (Figure  3C),  whereas  strains 
lacking  CUP9  ( cup9A  UBR1  and  cup9A  ubrlA)  overex¬ 
pressed  PTR2  (Figure  2A)  and  were  hypersensitive  to  toxic 
dipeptide  (Figure  3C). 

Cup9p  is  a  repressor  of  the  PTR2  gene 

To  address  the  mechanism  of  repression  by  Cup9p,  we 
asked  whether  purified  Cup9p  (see  Materials  and  methods) 
could  selectively  bind  to  specific  regions  of  the  PTR2 
promoter.  Gel  shift  assays  in  the  presence  of  poly-dl-dC 
competitor  DNA  were  performed  with  labeled  DNA 
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Fig.  3.  Relative  capacity  for  peptide  import  in  congenic  S.cerevisiae 
strains  that  contained,  lacked  or  overexpressed  Cup9p  and/or  Ubrlp. 
Serial  dilutions  of  the  indicated  strains  were  deposited  by  a  48-pin 
applicator  onto  either  rich  (YPD)  medium  (A),  minimal  medium 
containing  66  pM  Lys-Ala  dipeptide  as  the  sole  source  of  lysine  (B), 
or  minimal  medium  containing  both  lysine  (at  110  pM)  and  the  toxic 
dipeptide  Leu-Eth  (at  55  pM)  (C).  The  plates  were  incubated  at  30°C 
for  1-2  days.  The  relevant  genetic  loci  are  shown  on  the  left.  The 
'UBRL  overexpressed  CUP9 ’  strain  (JD52-2p-CUP9)  carried  the  high- 
copy  plasmid  pCB209  that  expressed  Cup9p  from  its  natural  promoter. 


fragments  of  the  PTR2  promoter  and  His6-tagged  Cup9p 
(Cup9p-H6)  purified  from  E.coli .  Cup9p-H6  bound  to  a 
site  within  a  distal  region  of  the  PTR2  promoter  (positions 
-448  to  “897  relative  to  the  inferred  CUP9  start  codon), 
but  did  not  bind  to  the  proximal  region  of  the  PTR2 
promoter  (positions  —1  to  —447)  under  the  same  condi¬ 
tions  (Figure  2B). 

The  transcriptional  repressor  function  of  Cup9p  was 
further  suggested  by  the  finding  that  the  co-repressor 
complex  Tuplp/Ssn6p  plays  a  role  in  the  control  of  peptide 
import.  The  Tuplp/Ssn6p  complex  inhibits  transcription 
of  many  yeast  genes  through  interactions  with  gene- 
specific  DNA-binding  repressors  such  as  Mata2p  (Chen 
et  al ,  1993;  Smith  et  al ,  1995),  a  homeodomain  homolog 
of  Cup9p.  We  found  that  most  of  our  sub  mutants  that 
were  not  CUP9  mutants  could  be  complemented  by  low- 
copy  plasmids  bearing  SSN6  or  TUP1  (G.Tumer,  S.Saha 
and  A. Varshavsky,  unpublished  data).  In  addition,  deletion 
of  SSN6 ,  like  deletion  of  CUP9,  restored  the  ability  of 
ubrIA  cells  to  import  peptides  (data  not  shown). 

Ubr Ip-dependent  degradation  of  Cup9p 

The  fact  that  deletion  of  CUP9  renders  PTR2  transcription 
independent  of  Ubrlp  (Figure  2 A),  and  the  observation 
that  overexpression  of  Ubrlp  enhances  the  import  of 
peptides  in  Cup9p-expressing  strains  (data  not  shown) 
suggested  that  the  N-end  rule  pathway  regulates  peptide 


import  by  targeting  Cup9p  for  degradation.  To  test  this 
conjecture,  we  carried  out  pulse-chase  experiments  with* 
a  C-terminally  FLAG-tagged  Cup9p  in  UBR1  and  ubrIA 
cells.  Cup9p-FLAG  was  a  very  short-lived  protein  ( tm 
-5  min)  in  UBR1  cells  (Figure  4).  By  contrast,  Cup9p 
was  much  longer-lived  (tm  >30  min)  in  ubrIA  cells 
(Figure  4).  Degradation  of  Cup9p  was  also  found  to 
depend  upon  UBC2  and  UBC4  (data  not  shown),  in 
agreement  with  the  observation  that  a  ubc2A  ubc4A  double 
mutant  failed  to  import  dipeptides  (Figure  1G).  The 
residual  instability  of  Cup9p  in  ubrIA  cells  (Figure  4) 
suggested  the  presence  of  a  second,  Ubrlp-independent 
degron;  this  pattern  is  reminiscent  of  another  homeo¬ 
domain  repressor,  Mata2p,  which  also  contains  at  least  two 
distinct  degradation  signals  (Hochstrasser  and  Varshavsky, 
1990;  Chen  et  al ,  1993). 

Ubrlp  recognizes  engineered  N-end  rule  substrates 
through  their  destabilizing  N-terminal  residues 
(Varshavsky,  1996).  To  determine  the  N-terminal  residue 
of  Cup9p,  we  overexpressed  and  purified  Cup9p-FLAG 
from  ubrIA  S.cerevisiae  and  subjected  it  to  N-terminal 
sequencing.  Cup9p-FLAG  was  found  to  have  a  blocked 
(presumably  acetylated)  N-terminus  (see  Materials  and 
methods),  suggesting  that  Ubrlp  targets  Cup9p  through  a 
degron  distinct  from  the  N-degron.  Independent  evidence 
for  this  conjecture  was  produced  through  the  analysis  of 
GST-Cup9p-ha2,  a  fusion  of  glutathione  transferase  (GST) 
and  C-terminally  ha-tagged  Cup9p.  Pulse-chase  analysis 
of  GST-Cup9p-ha2  revealed  that  the  fusion  protein  was 
nearly  as  short-lived  in  vivo  as  the  N-terminally  unmodified 
Cup9p-FLAG  (Figure  4,  lanes  b-e;  compare  with  Figure 
5,  lanes  b-e).  If  Cup9p  were  targeted  for  processive 
degradation  through  a  destabilizing  N-terminal  residue,  a 
preliminary  proteolytic  cleavage(s)  of  Cup9p  would  be 
required  to  expose  such  a  residue  (Varshavsky,  1996).  In 
the  case  of  GST-Cup9p-ha2,  this  cleavage  would  generate 
a  proteolytic  fragment  consisting  of  GST  and  an  N-terminal 
portion  of  Cup9p  preceding  the  cleavage  site.  Since  free 
GST  has  been  found  to  be  long-lived  when  expressed  in 
yeast  (data  not  shown),  accumulation  of  such  a  proteolytic 
fragment  would  be  expected  to  accompany  the  degradation 
of  GST-Cup9p-ha2. 

Pulse-chase  analysis  of  GST-Cup9p-ha2,  using  gluta¬ 
thione-agarose  beads  to  isolate  GST-containing  proteins 
(see  Materials  and  methods),  indicated  that  the  degradation 
of  GST-Cup9p-ha2  by  the  N-end  rule  pathway  was  not 
accompanied  by  the  appearance  of  a  fragment  containing 
the  N-terminal  GST  moiety  (Figure  5).  This^  finding 
strongly  suggested  that  Cup9p  bears  an  ‘internal*  degron 
recognized  by  Ubrlp.  Additional  support  for  this  conjec¬ 
ture  was  provided  by  truncation  analysis  of  a  Cup9p- 
DHFR-myc  fusion  protein.  These  experiments  indicated 
that  the  N-terminal  81  residues  of  the  306-residue  Cup9p 
are  dispensable  for  its  Ubrlp- dependent  degradation, 
strongly  suggesting  that  the  relevant  degradation  signal 
resides  in  the  C-terminal  two-thirds  of  Cup9p  (C.Byrd, 
I. Davydov  and  A. Varshavsky,  unpublished  data).  Although 
these  results  are  fully  consistent  with  the  presence  of  an 
internal  Ubrlp-dependent  degron  in  Cup9p,  there  remains 
the  less  parsimonious  possibility  that  Cup9p  is  degraded 
via  m^w-targeting  (Johnson  et  al ,  1990).  In  this  process, 
the  two  determinants  of  the  N-degron  (a  destabilizing 
N-terminal  residue  and  a  ubiquitin-accepting  internal  Lys 
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Fig.  4.  Degradation  of  Cup9p  by  the  N-end  rule  pathway.  Saccharomyces  cerevisiae  strains  carrying  pCB210,  a  low-copy  plasmid  that  expressed 
Cup9p-FLAG  from  the  P CUP9  promoter,  were  labeled  at  30°C  with  [35S]methionine/cysteine  for  5  min,  followed  by  a  chase  for  0,  5,  15,  30  and 
60  min,  preparation  of  extracts,  immunoprecipitation  of  Cup9p-FLAG  with  a  monoclonal  anti-FLAG  antibody,  SDS-PAGE  and  autoradiography/ 
quantitation  (see  Materials  and  methods).  Lane  a,  JD55  (ubrl A)  cells  that  carried  pCB200  (vector  alone).  Lanes  b-f,  JD52  ( UBR1 )  (wild-type)  cells 
that  carried  pCB210,  expressing  Cup9p-FLAG.  Lanes  g-k,  same  as  lanes  b-f,  but  with  JD55  (ubrIA)  cells. 


residue)  reside  in  two  different  polypeptides,  whose  inter¬ 
action  yields  an  active  N-degron,  leading  to  ubiquitinyl- 
ation  and  degradation  of  the  polypeptide  bearing  the  Lys 
residue  (Johnson  et  al ,  1990).  At  present,  little  is  known 
about  the  protein  ligands  of  Cup9p,  save  for  the  likely 
possibility  that  Cup9p  interacts  with  the  Tuplp/Ssn6p 
repressor  complex  (see  above),  similarly  to  the  previously 
established  interaction  of  this  complex  with  Mata2,  a 
homeodomain-containing  homolog  of  Cup9p  (Smith 
et  al,  1995). 

Discussion 

The  discovery  that  Ubrlp  controls  the  import  of  peptides 
through  degradation  of  the  Cup9p  repressor  (Figure  6) 
identifies  the  first  clear  physiological  function  of  the  N-end 
rule  pathway.  The  existence  of  mammalian,  plant  and 
bacterial  homologs  of  the  yeast  Ptr2p  transporter  suggests 
that  the  import  of  peptides  in  these  organisms  may  also 
be  regulated  by  the  N-end  rule  pathway. 

Why  was  this  pathway,  rather  than  another  Ub-depend- 
ent  proteolytic  system,  recruited  in  the  course  of  evolution 
for  the  regulation  of  peptide  import?  A  plausible  explan¬ 
ation  is  suggested  by  the  ability  of  Ubrlp  to  bind  peptides 
bearing  destabilizing  N-terminal  residues  (Varshavsky, 
1996).  Since  more  than  half  of  the  20  amino  acids  are 
destabilizing  in  the  yeast  and  mammalian  N-end  rules 
(Varshavsky,  1996),  a  significant  fraction  of  imported 
peptides  would  be  expected  to  compete  with  Cup9p  for 
binding  to  Ubrlp.  This  competition  may  decrease  the  rate 
of  Cup9p  degradation.  [Dipeptides  added  to  a  culture  of 
S.  cerevisiae  have  been  shown  to  inhibit  the  N-end  rule 
pathway  (Baker  and  Varshavsky,  1991).]  The  ensuing 
increase  in  the  level  of  Cup9p  repressor  would  in  turn 
decrease  the  production  of  the  Ptr2p  transporter,  diminish¬ 
ing  the  rate  of  peptide  import.  This  ‘peptide-sensing* 
negative  feedback  loop  would  maintain  the  intracellular 
concentration  of  short  peptides  within  a  predetermined 
range — potentially  a  useful  feature,  made  possible  by  the 
substrate-binding  properties  of  Ubrlp.  Experiments  to 
verify  this  model  are  under  way. 

Another  interesting  aspect  of  the  Ubrlp-Cup9p-Ptr2p 


wild-type  ubrIA 

Chase  (min):  0  5  10  30  0  5  10  30 


abode  f  g  h  i 


Fig.  5.  Degradation  of  GST-Cup9p-ha2  is  not  preceded  by  a  processing 
that  releases  a  GST-containing  fragment.  Saccharomyces  cerevisiae 
strains  carrying  pCB120,  a  low-copy  plasmid  that  expressed  GST- 
Cup9p-ha2  from  the  VGAU  promoter,  were  labeled  at  30°C  with 
[35S]  methionine/cysteine  for  5  min,  followed  by  a  chase  for  0,  5,  10 
and  30  min,  preparation  of  extracts,  isolation  of  GST-Cup9p-ha2  on 
glutathione-agarose  beads,  SDS-PAGE  and  autoradiography/ 
quantitation  (see  Materials  and  methods).  Lanes  b-e,  JD52  ( XJBR1 ) 
(wild-type)  cells  expressing  GST-Cup9p-ha2.  Lanes  f-i,  same  as  lanes 
b-e,  but  with  JD55  (ubrIA)  cells.  Lane  a,  molecular  mass  markers 
(in  kDa). 

regulatory  circuit  is  its  possible  relevance  to  copper 
homeostasis.  In  addition  to  functioning  as  a  repressor  of 
peptide  transport,  Cup9p  also  contributes  to  the  resistance 
of  S.  cerevisiae  to  copper  toxicity  during  growth  on  lactate, 
a  non-fermentable  carbon  source  (Knight  et  al,  1994). 
Copper  homeostasis  is  mediated  by  a  complex  set  of 
circuits,  some  of  which  also  regulate  iron  metabolism 
(Zhou  and  Thiele,  1993;  Ooi  et  al,  1996).  The  double 
role  of  Cup9p  in  regulating  both  peptide  import  and  copper 
homeostasis  may  signify  a  physiological  connection 
between  these  seemingly  unrelated  processes. 
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Fig.  6.  A  model  for  regulation  of  peptide  import  in  S.cerevisiae.  The 
expression  of  PTR2,  which  encodes  a  transmembrane  peptide 
transporter,  is  regulated  by  the  short-lived,  homeodomain-containing 
transcriptional  repressor  Cup9p.  The  concentration  of  Cup9p  is 
controlled  in  part  through  its  degradation  by  the  N-end  rule  pathway, 
whose  targeting  components  include  Ubrlp  and  either  Ubc2p  or 
Ubc4p  (see  the  main  text). 


Our  work  adds  Cup9p  to  the  list  of  short-lived  regulatory 
proteins  whose  degradation  is  mediated  by  the  ubiquitin 
system.  Many  of  these  proteins  are  negative  regulators.  For 
example,  Mata2p,  a  homeodomain-containing  homolog  of 
Cup9p,  controls  the  mating  type  of  S.cerevisiae  (a  or  a) 
through  repression  of  a-specific  genes  (Herskowitz,  1989). 
Mata2p  appears  to  be  constitutively  short-lived  in  haploid 
cells  (Hochstrasser  and  Varshavsky,  1990).  Therefore, 
cessation  of  Mata2p  synthesis  upon  the  conversion  of  an 
a  cell  into  an  a  cell  results  in  rapid  disappearance 
of  Mata2p  and  the  establishment  of  a-specific  circuits 
(Hochstrassser,  1996).  Progression  of  the  cell  cycle  is  also 
controlled  by  short-lived  negative  regulators,  in  particular 
by  Siclp,  an  inhibitor  of  CDK,  the  cyclin-dependent 
kinase  (Schwob  et  al ,  1994).  In  this  case,  however,  a 
rapid  ‘on-switch’  is  based  on  the  phosphorylation-induced 
degradation  of  the  previously  stable  Siclp  (King  et  al, 
1996).  Whether  the  Cup9p  repressor  (Figure  6)  is  constitu¬ 
tively  short-lived  or  whether  its  stability  is  regulated  by 
external  conditions  such  as,  for  example,  different  nitrogen 
sources,  remains  to  be  determined. 

The  finding  that  Cup9p  lacks  a  destabilizing  N-terminal 
residue  indicates  that  Ubrlp,  the  recognition  component 
of  the  N-end  rule  pathway,  is  able  to  target  substrates 
bearing  either  internal  degradation  signals  or  N-degrons. 
The  Ga  subunit  of  the  G  protein — the  other  known 
physiological  substrate  of  Ubrlp  in  S.cerevisiae  also 
lacks  a  destabilizing  N-terminal  residue  (Madura  and 
Varshavsky,  1994).  Thus,  a  substantial  fraction  of  naturally 
short-lived  proteins  targeted  by  the  N-end  rule  pathway 
may  bear  internal  degrons  rather  than  N-degrons,  a  pos¬ 
sibility  that  the  identification  of  other  physiological  sub¬ 
strates  of  this  pathway  will  address. 

Materials  and  methods 

Strains ,  media  and  genetic  techniques 

Saccharomyces  cerevisiae  strains  were  grown  in  rich  (YPD)  medium 
containing  2%  peptone,  1%  yeast  extract  and  2%  glucose  or  in  synthetic 
yeast  media,  containing  0.67%  yeast  nitrogen  base  without  amino  acids 
(Difco),  auxotrophic  nutrients  at  concentrations  specified  by  Sherman 
et  al.  (1986)  and  either  2%  glucose  (SD  medium),  2%  raffinose  (SM- 
raffinose  medium)  or  2%  galactose  (SM-galactose  medium)  as  carbon 
sources.  SHM-glucose  medium  used  in  halo  assays  was  prepared 
according  to  Island  et  al.  (1987)  and  was  identical  to  SD  medium  except 
that  it  lacked  methionine  and  contained  allantoin  (1  mg/ml)  and  yeast 
nitrogen  base  (Difco,  1.7  mg/ml)  without  amino  acids  and  ammonium 


sulfate.  Escherichia  coli  strain  DH5a  was  used  for  plasmid  propagation 
and  cloning  steps.  For  induction  of  the  ?GAL}  promoter,  cells  were  grown  # 
to  an  Am  of  0.5-1  in  SM-raffinose  medium,  pelleted  and  transferred 
to  SM-galactose  medium  for  3  h. 

Halo  and  dilution  assays 

Peptide  import  was  assayed  using  the  halo  and  dilution  methods.  For 
halo  assays  (Island  et  al. ,  1987),  cells  were  grown  to  an  A600  of  -1  in 
SHM-glucose  medium  with  auxotrophic  supplements.  Cells  were  pelleted 
by  centrifugation  and  resuspended  in  water  to  -5X107  cells/ml.  Then 
0.1  ml  of  cell  suspension  was  mixed  with  10  ml  of  0.8%  noble  agar 
(Difco)  at  55°C,  and  spread  on  plates  containing  20  ml  of  SHM  medium. 
Sterile  filter  disks  containing  the  toxic  dipeptide  L-leucyl-L-ethionine 
(Leu-Eth;  5  jimol)  were  placed  in  the  middle  of  each  plate,  followed  by 
incubation  at  30°C  for  1-2  days. 

For  dilution  assays,  strains  were  grown  (under  selection  for  plasmids) 
in  SD  medium  to  an  Agoo  of  ~1.  Cells  from  each  sample  (1.5  X 10  )  were 
spun  down  and  resuspended  in  1  ml  of  water.  The  samples  were  serially 
diluted  by  4-fold  in  a  microtiter  plate  (150  gl/well)  to  generate  eight 
different  initial  concentrations  of  cells  that  ranged  from  1.5X10  to  915 
cells/ml.  The  suspensions  were  spotted  to  various  media,  using  a  48-pin 
applicator.  The  plates  were  incubated  at  30°C  for  1-2  days. 

Leu-Eth  was  synthesized  using  standard  methods  of  organic  chemistry. 
Briefly,  L-ethionine  methyl  ester  (Eth-OMe)  was  produced  from 
L-ethionine  and  methanol.  Eth-OMe  was  then  coupled  with  N-tert - 
butoxy-c  arbony  1  -L-leucine-p-nitrophenyl  e  ster  (N-  f-B  OC-L-leucine- 
PNP),  yielding  N-r-BOC-L-leucyl-L-ethionine  methyl  ester,  which  was 
purified  by  flash  chromatography  on  a  silica  column.  N-r-BOC-L-leucyl- 
L-ethionine  methyl  ester  was  then  converted  into  N-r-BOC-L-leucyl-L- 
ethionine  by  treatment  with  KOH.  N-r-BOC-L-leucyl-L-ethionine  was 
deprotected  with  trifluoroacetic  acid,  yielding  Leu-Eth. 

A  screen  for  import-competent  mutants  in  the  ubrIA 
background 

Thirty  5-ml  cultures  of  JD83-1A  {leu2-3  ubrIA),  auxotrophic  for  leucine, 
were  grown  to  an  A^qq  of  —  1,  and  — 1.5X107  cells  from  each  culture 
were  plated  onto  SD  medium  lacking  leucine  and  histidine  but  containing 
the  leucyl-histidine  dipeptide  at  0.23  jUM.  Plates  were  incubated  at  30°C 
for  2  days,  selecting  for  mutants  able  to  grow  on  this  medium.  Of  the 
320  sub  mutants  (‘suppressors  of  a  block  to  peptide  import  in  ubrIA') 
thus  obtained,  199  were  found  by  complementation  tests  to  be  clearly 
recessive.  Among  these,  101  mutants  belonged  to  one  complementation 
group,  termed  subl. 

Isolation  of  the  CUP3  (SUB1)  gene 

A  50  ml  culture  of  S.cerevisiae  CBY15  ( ubrIA  subl-1 )  was  grown  to 
an  A600  of  -1  in  SHM-glucose.  Cells  were  made  competent  with  lithium 
acetate  (Ausubel  et  al ,  1992),  and  25  0.1-ml  samples  (-7X10  cells/ 
ml)  were  transformed  with  a  yeast  genomic  DNA  library  (American 
Type  Culture  Collection;  #77164)  in  the  TRP1,  CEN6- based  vector 
pRS200  (Sikorski  and  Hieter,  1989).  The  cells  were  pelleted,  and  each 
sample  was  resuspended  in  1.5  ml  of  SHM-glucose  and  incubated  at 
30°C  for  3  h.  The  cells  were  pelleted  again,  each  sample  was  resuspended 
in  0.1  ml  of  water,  added  to  10  ml  of  0.8%  Noble  agar  at  55  C,  and 
spread  onto  SHM-glucose  plates  lacking  Trp  and  containing  Leu-Eth  at 
37  jiM.  Of  the  -2.5  X104  Trp+  transformants  plated,  14  could  grow  in 
the  presence  of  Leu-Eth.  Of  these,  two  yielded  the  library- derived 
plasmids,  pSUBl-1  and  pSUBl-6,  that  complemented  the  subl  mutation 
of  CBY15.  The  insert  of  pSUBl-6  contained  the  insert  of  pSUBl-1. 
Partial  sequencing  of  the  —8.2  kb  insert  of  pSUBl-1  identified  it  as  a 
region  of  the  S.cerevisiae  chromosome  XVI.  Deletion  analysis  (not 
shown)  localized  the  complementing  activity  to  an  —2.8  kb  Hindlll— 
Kpnl  fragment,  whose  sequence  revealed  the  presence  of  two  ORFs 
(. CUP9  and  SCYPL178W).  Further  deletion  analysis  (not  shown)  local¬ 
ized  the  complementing  activity  to  the  previously  isolated  (Knight  et  al. , 
1994)  CUP9  gene.  Verification  that  CUP9  and  SUB1  were  one  and  there 
same  gene  was  carried  out  as  described  in  Results. 

Construction  of  null  cup9  mutants 

The  -2.8  kb  Hindlll-Kpnl  genomic  DNA  fragment  containing  CUP9 
was  ligated  to  Hindlll-Kpnl-cnt  pRS306ASpel,  yielding  pCB117. 
pRS306ASpel  was  derived  from  the  URA3- bearing  pRS306  (Sikorski 
and  Hieter,  1989)  through  elimination  of  its  Spe I  site.  The  CXJP9  ORF 
was  disrupted  by  inserting  an  -2  kb,  blunt-ended,  LEU2-C ontaining  Sail 
fragment  from  pJJ283  into  the  Spe  I  site  of  CUP9,  yielding  pCB119,  in 
which  the  LEU2  and  CUP9  ORFs  were  oriented  in  opposite  directions. 
An  -5  kb  //mdlll-PvuII  fragment  of  pCB  119  containing  the  cup9::LEU2 
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Table  I.  Saccharomyces  cerevisiae  strains  used  in  this  study 


‘Strain  Genotype 


References 


DF5 

EJ1 

EJ2 

EJ3 

EJ10 

EJY102 

EJY105 

EJY106 

MHY495 

MHY503 

MHY507 

MHY550 

MHY552 

MHY554 

MHY557 

MHY570 

MHY599 

MHY601 

YW055 

YPH500 

BBY67 

KM207-1 

JD34 

JD51 

JD52 

JD53 

JD55 

JD83-1A 

RJD549 

RJD795 

SGY6 

CBY4 

CBY5 

CBY6 

CBY7 

CBY8 

CBY9 

CBY10 

CBY11 

CBY15 

CBY16 

CBY17 

CBY19 

CBY23 

CBY24 


MATalMATa  trpl-lArpl-1  ura3-52/ura3-52  his3-A200/his3-A200  leu2-3,112Aeu2-3,112  lys2-801/ 
lys2-801  gal/gal 

MATa  ubc5A::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3,U2  lys2-801  gal 

MATa  ubcl A::URA3  trpl-l  ura3-52  his3-A200  leu2-3y112  lys2-801  gal 

MATa  ubc4A::HIS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 

MATa  ubc4A::HIS3  ubc5A:;LEU2  trpl-l  ura3-52  his3-A200  leu2-3f112  lys2-801  gal 

MATa  ubc2A::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 

MATh  ubc2A::LEU2  ubc4A::HlS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 

MATa  ubc2A::LEU2  ubc4A::HIS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 

MATa  ubc6-Al::HIS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  ubc4-Al::HIS3  ubc6-Al::HlS3  trpl-l  ura3-52  his3-A200  leu2-3,112  lys2-801 

MATa  ubc7::LEU2  trpl-l  ura3-52  his3-A200  leu2-3,112  lys2-801 

MATa  ubc4-A2::TRPl  ubc5-Al::LEU2  ubc6-Al::HIS3  trpl-l  ura3-52  his3-A200  Ieu2-3J12 
lys2-801 

MATa  ubc6-Al::HlS3  ubc7::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  ubc4-Al::HIS3  ubc7::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  ubc4-Al::HIS3  ubc6-Al::HIS3  ubc7::LEU2  trpl-l  ura3-52  his3-A200  leu2-3,112  lys2-801 

MATa  ubc4-A2::TRPl  ubc5-Al::LEU2  ubc6-Al ::HIS3  ubc7::LEU2  trpl-l  ura3-52  his3-A200 

leu2-3,112  lys2-801 

MATa  pas2  trp  ura3-52  adel  leu2-3 

MATa  ubc8::URA3  trpl-l  ura3-l  ade2-l  his3-ll  Ieu2-3J12  canl-100 

MATa  ubc9-l  trpl-l  ura3-52  his3-A200  leu2-3,112  lys2-801 

MATa  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  Iys2-8Q1 

MATa  ubc2A::LEU2  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  lys2-801 

MATa  atelA::TRPl  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  lys2-801 

MATa  ubc2A::URA3  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-Al  lys2-801 

MATsdMATa  trpl-A63/trpl-A63  ura3-52/ura3-52  his3-A200/his3-A200  leu2-3J12Aeu2-3,112 

lys2-801Ays2-801 

MATa  trpl  -A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  ubrlA::HIS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 

MATa  ubrlA::HlS3  trpl-A63  ura3-52  his3-A200  leu2-3,112  lys2-801 

MATa  cdc34-l  trpl  ura3-52  leu2-3 

MATa  cdc34-2  trpl  ura3-52  leu2-3 

MATa  ntalA::TRPl  trpl-A63  ura3-52  ade2-101  his3-A200  Ieu2-3J12  lys2-801 
MATa  ubc2A::URA3  ubc5A::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801  gal 
MATa  ubc2A::LEU2  ubc6-Al : :HIS3  trpl-l  ura3-52  his3-A200  leu2-3,112  lys2-801 
MATa  ubc2A::URA3  ubc7::LEU2  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 
MATa  ubc2A::LEU2  ubc8::URA3  trpl-l  ura3-l  ade2-l  his3-ll  Ieu2-3J12  canl-100 
MATa  ubc2A::LEU2  ubclA::URA3  trpl-l  ura3-52  his3-A200  leu2-3,112  lys2-801  gal 
MATa  ubc2A::LEU2  ubc9-l  trpl-l  ura3-52  his3-A200  Ieu2-3J12  lys2-801 
MATa  ubc2A::LEU2  pas2  trp  ura3-52  adel  leu2-3 
MATa  ubc2A::LEU2  cdc34-l  trpl  ura3-52  leu2-3 

MATa  subl-1  ubrlA::HlS3  trpl-A63  ura3-52  his3-A200  Ieu2-3J12  lys2-801 
MATa  cup9::LEU2  ubrlA::HIS3  trpl-A63  ura3-52  his3-A200  leu2-3,112  lys2-801 
MATa  cup9::LEU2  ubrlA::HIS3  trpl-A63  ura3-52  his3-A200  leu2-3,112  lys2-801 
MATa  cup9::LEU2  trpl-A63  ura3-52  his3-A200  Ieu2-3t112  lys2-801 

MATalMATa  subl-l/cup9A::LEU2  ubrl A::HlS3/ubrlA:  :H1S3  trpl-A63Arpl-A63  ura3-52/ura3-52 
his3-A200Aiis3-A200  leu2-3,112Aeu2-3J12  lys2-801Ays2-801 

MAUdMATa  subl-l/cup9A::LEU2  ubrl  A:  :HIS3Aibrl  A: :  HIS 3  trpl  -  A63 Arp  1-A63  ura3-52Aira3-52 
his3-A200/his3-A200  leu2-3,112Aeu2-3J12  lys2-801Ays2-801 


Finley  et  al  (1987) 

Derivative  of  DF5a 
Derivative  of  DF5a 
Derivative  of  DF5a 
Derivative  of  DF5a 
Derivative  of  DF5a 
Derivative  of  DF5a 
Derivative  of  DF5a 
Derivative  of  DF5b 
Derivative  of  DF5b 
Derivative  of  DF5b 
Derivative  of  DF5b 

Derivative  of  DF5b 
Derivative  of  DF5b 
Derivative  of  DF5b 
Derivative  of  DF5b 

Chen  et  al  (1993) 

Chen  et  al  (1993) 

Derivative  of  DF5C 
Sikorski  and  Hieter  (1989) 
Derivative  of  YPH500d 
Derivative  of  YPH500e 
Derivative  of  YPH500f 
Dohmen  et  al  (1995) 

Johnson  et  al  (1995) 

Dohmen  et  al  (1995) 

Madura  and  Varshavsky  (1994) 
Derivative  of  JD51f 
R.Deshaies8 

R. Deshaies® 

S.  Grigory  evh 
Derivative  of  EJ1 
Derivative  of  MHY495 
Derivative  of  MHY507 
Derivative  of  MHY601 
Derivative  of  EJ2 
Derivative  of  YW055 
Derivative  of  MHY599 
Derivative  of  RJD549 
Derivative  of  JD83-1A 
Derivative  of  JD55 
Derivative  of  JD55 
Derivative  of  JD52 
Produced  by  mating  CBY15  and 
CBY16 

Produced  by  mating  CBY15  and 
CBY17 


aJohnson  et  al  (1992,  1995).  A  gift  from  E. Johnson,  the  Rockefeller  University,  New  York,  NY  10021-6399,  USA. 

bChen  et  al  (1993).  A  gift  from  M.Hochstrasser,  Department  of  Biochemistry  and  Molecular  Biology,  University  of  Chicago,  Chicago,  IL  60637, 
USA. 

CA  gift  from  S.Jentsch,  ZMBH,  Universitat  Heidelberg,  69120  Heidelberg,  Germany. 
dDohmen  et  al  (1990). 

eA  gift  from  K.Madura,  Department  of  Biochemistry,  UMDNJ- Johnson  Medical  School,  Piscataway,  NJ  08854,  USA. 
fA  gift  from  J. Dohmen,  Heinrich-Heine-Universitat,  Institut  ftlr  Mikrobiologie,  40225  Diisseldorf,  Germany. 

8A  gift  from  R.Deshaies,  Division  of  Biology,  Caltech,  Pasadena,  CA  91125,  USA. 

hA  gift  from  S. Grigoryev,  Department  of  Biology,  University  of  Massachusetts,  Amherst,  MA  01003,  USA. 


disruption  allele  was  used  to  replace  the  wild-type  CUP9  alleles  of  JD52 
(wild-type)  and  JD55  {ubrl A)  by  homologous  recombination  (Rothstein, 
1991),  generating  strains  CBY19  and  CBY17,  respectively. 

CUP9-expressing  plasmids 

The  plasmid  pCB116,  which  expressed  Cup9p-FLAG  from  the  P GAu 
promoter,  was  constructed  by  subcloning  an  -1  kb  BamBl-EcoKl 
fragment  containing  the  CUP9-FLAG  ORF  into  the  BaniHl-EcoRl  site(s) 
of  p416GALl  (Mumberg  et  al. ,  1994).  The  CUP9-FLAG- containing 
fragment  was  constructed  by  PCR  amplification  of  the  CUP9  ORF  of 


plasmid  pCBlll  using  primers  PCB1  (5'-CGCGGATCCGAATAGT- 
TACATTCGAAGATG-3')  and  PCB6  (5'-CCGGAATTCTCATTTA- 
TC  ATC  ATCGTCTTTGTA  ATC  ATTC  ATATC  AGGGTTGG  ATAG-3 ' ), 
resulting  in  the  addition  of  the  8-residue  FLAG  epitope,  DYKDDDDK, 
to  the  C-terminus  of  Cup9p.  pCBlll  was  constructed  by  subcloning  the 
-2.8  kb  Hindlll-Kpnl  fragment  of  the  C£/P9-containing  pSUBl-1  (see 
above)  into  Hindlll-Kpnl-cuX  pRS316  (Sikorski  and  Hieter,  1989). 
pCB202  was  constructed  by  subcloning  an  -1  kb  fragment  containing 
the  1?cup9  promoter  into  HmdIII-itamHI-cut  pCB201.  [The  -1  kb 
fragment  was  produced  by  PCR  from  pCBlll  using  primers  PCB8 
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(5  '-GTGTTAGTAAGCTTGTAAAGG  AATGCACGTATT-3 ')  and  PCB9 
(5 ' -CCC  GCGG  ATCCGC  ATGC  A  ACTATTCTCG  A  AGGTTGT-3 ' )  d 
pCB200  (ARS-CEN,  LEU2)  and  pCB201  (2p,  LEU2)  were  constructed 
by  replacing  the  517  bp  ScaI-£c<?RI  fragment  of  pBR322  (Ausubel 
et  al,  1992)  with,  respectively,  the  3822  bp  Scal-Nael  fragment  of 
pRS415  (Sikorski  and  Hieter,  1989)  and  the  4650  bp  Scal-Nael  fragment 
of  pRS425  (Christianson  et  al,  1992). 

The  plasmid  pCB209  (2p,  LEU2),  which  expressed  CUP9  from  the 
P cup9  promoter,  was  constructed  by  replacing  the  Sphl-Sall  fragment 
of  pCB202  with  an  -1  kb  fragment  containing  the  CUP9  ORF  that 
was  produced  by  PCR  from  pCBlll,  using  primers  PCB10  (5  -CCC- 
GCGGATCCGCATGCGAAGATGAATTATAACTGC-3')  and  PCB12 
(5'-CCCGCGCGGTCGACCTCAATTCATATCAGGGTTGGATAG-3'). 

pCB210  (ARS-CEN,  LEU2)  that  expressed  Cup9p-FLAG  from  the  P CUP9 
promoter  was  constructed  by  replacing  the  Sphl-SaH.  fragment  of  pCB202 
with  an  -1  kb  fragment  containing  the  CUP9-FLAG  ORF,  which  was 
produced  from  pCBlll  using  primers  PCB10  (see  above)  and  PCB13 
(5'-CCCGCGCGGTCGACCTCATTTATCATCATCGTCTTTGTA- 

ATC ATTC ATATC AGG GTTGGATAG-3 ' ),  yielding  pCB211.  The  -2  kb 
HindlXl-SaU.  fragment  of  pCB211  containing  P cvP9  an^  GUP9- 
FLAG  ORF  was  subcloned  into  pCB200,  yielding  pGB210. 

Plasmid  pCB120,  expressing  GST-Cup9p-ha2  from  the  P gali  promoter, 
was  constructed  by  subcloning  the  -1.6  kb  Xbal-EcoRl  fragment, 
containing  the  GST-CUP9-ha2  ORF,  into  the  Xbal-EcoRl  site(s)  of 
p416GALl.  The  Xbal-EcoRl  fragment  was  produced  by  PCR  amplifica¬ 
tion  of  the  GST-CUP9-ha2  ORF  of  pGEX-2T-CUP9-ha2,  using  the 
primers  PCB3  (5 '  -CCGGA ATT CTC A AGC  GTA AT CT GGAACATC- 

GTATGGGTAAGCGTAATCTGGAACATCGTATGGGTAATTCATA- 

TCAGGGTTGGATAG-3')  and  PCB5  (5'-TGCTCTAGAACAGT- 
ATTC  ATGTCCCCTATA-3 ') .  pGEX-2T-CUP9-ha2  was  constructed  by 
subcloning  an  -1  kb  fragment  containing  the  CUP9-ha2  ORF  into  the 
BamRl-EcoRl  site(s)  of  pGEX-2T  (Pharmacia),  resulting  in  an  in-frame 
fusion  of  the  sequence  encoding  26  kDa  glutathione  .S-transferase  (GST) 
domain  of  Saccharomyces  japonicum  (Smith  and  Johnson,  1988)  to  the 
second  codon  of  CUP9.  The  CUP9-ha2-i containing  fragment  was  pro¬ 
duced  by  PCR  amplification  of  the  CUP9  ORF  of  pCBlll  using  the 
primers  PCB3  (see  above)  and  PCB4  (5'-CGCGGATCCAATTAT~ 
AACTG  CG  AAATAC  A  A  A  AC-3 ' ) .  This  step  added  to  the  C-terminus  of 
Cup9p  a  sequence  encoding  a  tandem  repeat  of  the  9-residue  sequence 
YPYDVPDYA,  derived  from  hemagglutinin  (ha)  of  influenza  virus. 

Northern  hybridization 

RNA  was  isolated  from  S.cerevisiae  as  described  (Schmitt  et  al. ,  1990). 
Electrophoresis  of  the  RNA  samples  was  carried  out  on  a  formaldehyde 
RNA  gel  (Ausubel  et  al,  1992).  An  -50-pg  RNA  sample  was  loaded 
on  a  1%  agarose  gel  containing  IX  MOPS  buffer,  0.74%  (v/v)  formalde¬ 
hyde,  1.9  mg/ml  iodoacetamide  and  0.5  pg/ml  ethidium  bromide. 
Electrophoresis  was  carried  out  in  IX  MOPS  buffer  at  5  V/ cm. 
RNA  was  transferred  to  BrightStar-Plus  membrane  (Ambion)  using 
TurboBlotter  (Schleicher  &  Schuell)  and  Ambion  RNA  transfer  buffer. 
RNA  was  crosslinked  to  the  air-dried  membranes  using  254  nm  light 
(Ausubel  et  al,  1992). 

DNA  probes  were  prepared  by  the  random  priming  method  (Ausubel 
et  al,  1992)  using  [32P]dCTP  and  a  DNA  labeling  kit  (Pharmacia). 
Hybridization  was  carried  out  for  8-16  h  at  42°C  in  Prehybridization/ 
Hybridization  Solution  (Ambion).  Filters  were  washed  according  to  the 
manufacturer’s  protocol  and  subjected  to  autoradiography. 

Gel  shift  assay 

PCR  was  used  to  extend  the  Cup9p  ORF  with  a  sequence  encoding 
Ser-GIy-Gly-Thr-His6,  yielding  Cup9p-H6,  and  to  engineer  flanking 
restriction  sites  (Ndel  and  BamHl)  for  insertion  into  pET-llc  (Novagen). 
Cup9p-Hg  was  overexpressed  in  E.coli  BL21  (DE3)  (Novagen)  (Ausubel 
et  al.,  1992)  and  purified  on  a  3  ml  Ni-NTA  column  (Qiagen),  using  a 
linear  gradient  of  imidazole.  Cup9p-Hg  eluted  at  ~0.25  M  imidazole 
(-90%  pure  at  this  step);  it  was  dialyzed  at  4CC  against  10%  glycerol, 
0.1  M  KC1,  1  mM  EDTA,  0.5  mM  dithiothreitol,  20  mM  HEPES, 
pH  7.9,  and  then  snap-frozen  in  multiple  samples  in  liquid  N2,  and 
stored  at  -80°C.  The  proximal  (-1  to  -447)  and  distal  (-448  to  -897) 
PTR2  promoter  probes  for  the  gel  shift  assay  were  constructed  by  PCR 
amplification  in  the  presence  of  [a-32P]dCTP,  and  were  purified  using 
spin  columns  (Qiagen).  The  gel  shift  reactions  (20  jil)  contained 
50  pg/ml  poly-dl-dC  (Pharmacia);  -1.5  pg/ml  (500  c.p.m.)  DNA  probe; 
1  mg/ml  acetylated  serum  albumin  (New  England  Biolabs)  and  either 
0.1,  0.5  or  1  pg/ml  of  Cup9p-H£  in  10%  glycerol,  0.1  M  KC1,  2.5  mM 
MgCl2,  1  mM  EDTA,  20  mM  HEPES,  pH  7.9.  The  samples  were 
incubated  for  30  min  at  room  temperature,  then  loaded  onto  a  4% 


polyacrylamide  gel  (40:1,  acrylamide :6is-acrylamide)  in  0.5  X  TBE 
(Ausubel  et  al,  1992),  and  electrophoresed  at  10  V/cm  for  3  h  at  4°C, 
followed  by  autoradiography. 

Pulse-chase  analysis  of  Cup9p 

One  hundred  ml  cultures  of  S.cerevisiae  JD52  ( UBR1 )  and  JD55  (ubrlA) 
carrying  either  pCB210  (expressing  Cup9p-FLAG  from  the  CUP 9 
promoter)  or  pCB200  (vector  alone)  were  grown  to  an  A^qo  of  —1  in 
SD(-Leu)  medium.  Cells  (50  A60o  units  total)  from  each  of  the  four  cul¬ 
tures  were  gently  pelleted  by  centrifugation,  washed  with  5  ml  of  SD(- 
Leu),  pelleted  again,  resuspended  in  2  ml  of  SD(-Leu),  and  incubated 
at  30°C  for  10  min.  Each  sample  was  labeled  for  5  min  with  1.4  mCi 
of  [35S]methionine/cysteine  (EXPRESS,  New  England  Nuclear)  at  30°C, 
followed  by  pelleting  in  a  microfuge  for  - 1 5  s .  The  cells  were  resuspended 
in  2.6  ml  of  SD(-Leu),  5  mM  L-methionine,  5  mM  L-cysteine,  and 
incubated  at  30°C.  Samples  of  0.5  ml  were  withdrawn  during  the 
incubation,  pelleted  and  resuspended  in  0.15  ml  of  0.5  M  NaCl-Lysis 
Buffer  (1%  Triton  X-100,  0.5  M  NaCl,  5  mM  EDTA,  50  mM  Na- 
HEPES,  pH  7.5)  containing  a  mixture  of  protease  inhibitors  (Ghislain 
et  al,  1996).  Glass  beads  (0.5  mm)  were  added,  and  cells  were  disrupted 
by  vortexing  (six  times,  for  30  s  each,  with  1  min  incubations  on  ice  in 
between),  followed  by  the  adjustment  of  NaCl  concentration  to  0.15  M 
through  the  addition  of  75  mM  NaCl-Lysis  Buffer,  further  vortexing  for 
30  s,  and  centrifugation  at  12  000  g  for  10  min.  The  volumes  of 
supernatants  were  adjusted  to  equalize  the  amounts  of  10%  trichloroacetic 
acid-insoluble  35S.  Cup9p-FLAG  was  immunoprecipitated  by  the  addition 
of  20  pi  of  the  monoclonal  anti-FLAG  M2  antibody  conjugated  to 
agarose  beads  (Kodak).  Suspensions  were  incubated  at  0°C  for  1  h,  with 
rotation,  then  centrifuged  at  12  000  g  for  30  s,  and  washed  four  times 
with  0.8  ml  of  0.15  M  NaCl-Lysis  Buffer.  The  pellets  were  resuspended 
in  SDS-sample  buffer,  heated  at  100°C  for  3  min,  and  subjected  to  SDS- 
12%  PAGE,  followed  by  autoradiography  and  quantitation  using  a 
Phosphorlmager  (Molecular  Dynamics). 

Pulse-chase  analysis  of  GST-Cup9p-ha2  was  carried  out  as  described 
by  Bartel  et  al  (1990).  Approximately  10  Agoo  units  of  galactose-induced 
cells  were  labeled  for  5  min  with  0.3  mCi  of  [35S]EXPRESS  in  400  pi 
SM-galactose  (-Ura)  at  30°C.  The  cells  were  then  transferred  to 
microfuge  tubes,  pelleted  and  resuspended  in  500  pi  of  SD  (-Ura), 
5  mM  L-methionine,  5  mM  L-cysteine.  Samples  of  0.1  ml  were  withdrawn 
during  the  incubation,  pelleted  and  lysed  as  above.  The  35S-labeled  GST- 
Cup9p-ha2  was  purified  using  glutathione-agarose  beads  (Sigma)  which 
had  been  blocked  with  bovine  serum  albumin  (BSA;  10  mg/ml).  Twenty 
pi  of  glutathione-agarose  beads  were  added  to  each  sample  and  the 
suspensions  were  incubated  at  0°C  for  60  min,  with  rotation,  followed 
by  washes  and  electrophoretic  analyses  as  described  for  Cup9p-FLAG. 

Purification  and  N-terminal  sequencing  of  Cup9p-FLAG 

Four  2-1  cultures  of  JD55  (ubrlA)  carrying  pCB116  that  expressed 
Cup9p-FLAG  from  the  GALI  promoter  were  grown  under  selection  in 
SM-raffinose  to  an  A60o  of  -0.8,  followed  by  transfer  to  SM-galactose 
and  incubation  at  30°C  for  3  h.  Longer  induction  times  resulted  in  a 
Cup9p-mediated  cytotoxicity  and  lower  yields  of  Cup9p-FLAG.  The 
cells  (-1 X  10n)  were  harvested  and  lysed  at  4°C  as  described  by  Burgers 
(1995).  The  extract  was  fractionated  by  precipitation  with  0.4%  PolyminP 
(Sigma)  and  then  further  by  precipitation  with  48%  saturated  ammonium 
sulfate.  The  pellet  was  dissolved  in  3  ml  of  TBS  buffer  (0.15  M  NaCl, 
50  mM  Tris-HCl,  pH  7.5),  and  passed  through  Sephadex  G-25  in  TBS. 
The  resulting  sample  (8.3  ml,  -70  mg/ml  of  protein)  was  applied  to  a 
column  (1  ml)  of  the  monoclonal  anti-FLAG  M2  antibody  (Kodak).  The 
column  was  washed  three  times  with  10  ml  of  TBS,  and  Cup9p-FLAG 
was  eluted  by  the  addition  of  five  1-ml  samples  of  TBS  containing, 
respectively,  50,  100,  100,  200  and  200  pg/ml  of  the  FLAG  peptide 
(Kodak).  Peak  Cup9p-FLAG  fractions  (detected  by  immunoblotting) 
were  concentrated  by  partial  lyophilization,  followed  by  precipitation 
with  methanol  (Wessel  and  Fliigge,  1984)  in  the  presence  of  human 
insulin  (Sigma,  0.3  mg/ml)  as  a  carrier.  The  resulting  sample  was 
fractionated  by  SDS-12%  PAGE  and  electroblotted  onto  Pro-Blot 
membrane  (Perkin-Elmer).  After  a  brief  staining  with  Coomassie,  the 
band  of  the  37  kDa  Cup9p-FLAG  (-15  pmol)  was  excised  from  the 
membrane.  Half  of  the  sample  was  used  to  determine  the  amino  acid 
composition;  the  other  half  was  subjected  to  N-terminal  sequencing  for 
seven  cycles,  using  the  Applied  Biosystems  476A  protein  sequencer  at 
the  Caltech  Microchemistry  Facility. 
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Ubiquitin-dependent  degradation  of  intracellular 
proteins  underlies  a  multitude  of  biological  processes, 
including  the  cell  cycle,  cell  differentiation,  and  re¬ 
sponses  to  stress.  One  ubiquitin-dependent  proteo¬ 
lytic  system  is  the  N-end  rule  pathway,  whose  targets 
include  proteins  that  bear  destabilizing  N-terminal 
residues.  This  pathway,  which  has  been  characterized 
only  in  somatic  cells,  is  shown  here  to  be  present  also 
in  germ  line  cells  such  as  the  eggs  of  the  amphibian 
Xenopus  laevis .  We  demonstrate  that  the  set  of  desta¬ 
bilizing  residues  in  the  N-end  rule  pathway  of  Xenopus 
eggs  is  similar,  if  not  identical,  to  that  of  somatic  cells 
such  as  mammalian  reticulocytes  and  fibroblasts.  It  is 
also  shown  that  the  degradation  of  engineered  N-end 
rule  substrates  in  egg  extracts  can  be  strongly  and 
selectively  inhibited  by  dipeptides  bearing  destabiliz¬ 
ing  N-terminal  residues.  This  result  allowed  us  to  ask 
whether  selective  inhibition  of  the  N-end  rule  path¬ 
way  in  egg  extracts  influences  the  apoptosis-like 
changes  that  are  observed  in  these  extracts.  A  dipep¬ 
tide  bearing  a  bulky  hydrophobic  (type  2)  destabiliz¬ 
ing  N-terminal  residue  was  found  to  delay  the  apopto- 
tic  changes  in  egg  extracts,  whereas  dipeptides  bear¬ 
ing  basic  (type  1)  destabilizing  N-terminal  residues 
had  no  effect.  High  activity  of  the  N-end  rule  pathway 
in  egg  extracts  provides  an  alternative  to  reticulocyte 
extracts  for  the  in  vitro  analyses  of  this  pathway. 

C  1998  Academic  Press 

Key  Words:  ubiquitin;  proteolysis;  N-end  rule;  apop¬ 
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A  number  of  regulatory  circuits,  including  those  that 
control  the  cell  cycle,  cell  differentiation,  and  responses 
to  stress,  involve  metabolically  unstable  proteins  (1— 
4).  A  short  in  vivo  half-life  of  a  regulator  provides  a  way 
to  generate  its  spatial  gradients  and  allows  for  rapid 

1  To  whom  correspondence  should  be  addressed  at  Division  of 
Biology,  147-75,  Caltech,  1200  East  California  Boulevard,  Pasadena, 
CA  91125.  Fax:  (626)  440-9821.  E-mail:  avarsh@cco.caltech.edu. 


adjustments  of  its  concentration,  or  subunit  composi¬ 
tion,  through  changes  in  the  rate  of  its  synthesis  or 
degradation.  Damaged  or  otherwise  abnormal  proteins 
tend  to  be  short-lived  as  well. 

Features  of  proteins  that  confer  metabolic  instability 
are  called  degradation  signals,  or  degrons  (5).  The  es¬ 
sential  component  of  one  degradation  signal,  called  the 
N-degron,  is  a  destabilizing  N-terminal  residue  of  a 
protein  (6,  7).  The  set  of  amino  acid  residues  which  are 
destabilizing  in  a  given  cell  yields  a  rule,  called  the 
N-end  rule,  which  relates  the  in  vivo  half-life  of  a 
protein  to  the  identity  of  its  N-terminal  residue.  Simi¬ 
lar  but  distinct  versions  of  the  N-end  rule  pathway  are 
present  in  all  organisms  examined,  from  mammals  to 
fungi  and  bacteria  (7). 

In  eukaiyotes,  an  N-degron  comprises  at  least  two 
determinants:  a  destabilizing  N-terminal  residue  and  an 
internal  lysine  or  lysines  (8-10).  The  Lys  residue  is  the 
site  of  formation  of  a  multiubiquitin  chain  (11).  The  N- 
end  rule  pathway  is  thus  a  part  of  the  ubiquitin  (Ub)2 
system.  Ub  is  a  76-residue  eukaryotic  protein  whose  co¬ 
valent  conjugation  to  other  proteins  plays  a  role  in  a 
multitude  of  biological  processes  (1,  12-15).  In  most  of 
them,  Ub  acts  through  routes  that  involve  the  degrada¬ 
tion  of  Ub-protein  conjugates  by  the  26S  proteasome,  an 
ATP-dependent  multisubunit  protease  (16-18). 

It  was  found  that  linear  Ub  fusions  are  rapidly 
cleaved  by  Ub-specifie  proteases  (UBPs)  at  the  Ub- 
protein  junction,  making  possible  the  production  of 
otherwise  identical  proteins  bearing  different  N-termi- 
nal  residues.  This  technical  advance  led  to  the  discov¬ 
ery  of  the  N-end  rule  pathway  (6,  7).  The  N-end  rule  is 
organized  hierarchically.  In  the  yeast  Saccharomyces 
cerevisiae ,  Asn  and  Gin  are  tertiary  destabilizing  N- 
terminal  residues  in  that  they  function  through  their 
conversion,  by  enzymatic  deamidation,  into  the  second - 

2  Abbreviations  used:  Ub,  ubiquitin;  DHFR,  mouse  dihydrofolate 
reductase;  R-transferase,  Arg-tRNA-protein  transferase;  CSF,  cyto¬ 
static  factor;  TCA,  trichloroacetic  acid;  DTT,  dithiothreitol;  AMC, 
7-amino-methylcoumarin;  UBP,  Ub-specific  processing  protease. 
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ary  destabilizing  N-terminal  residues  Asp  and  Glu. 
The  destabilizing  activity  of  N-terminal  Asp  and  Glu 
requires  their  conjugation,  by  Arg-tRNA-protein  trans¬ 
ferase  (R-transferase),  to  Arg,  one  of  the  primary  de¬ 
stabilizing  residues  (19-21).  The  primary  destabilizing 
N-terminal  residues  are  bound  directly  by  N-recognin 
(also  called  E3),  the  recognition  component  of  the  N- 
end  rule  pathway.  In  S.  cerevisiae,  N-recognin  is  a 
225-kDa  protein,  encoded  by  UBR1,  that  targets  poten¬ 
tial  N-end  rule  substrates  through  the  binding  to  their 
primary  destabilizing  N-terminal  residues  —  Phe,  Leu, 
Trp,  Tyr,  lie,  Arg,  Lys,  or  His  (7,  22).  N-recognin  has  at 
least  two  substrate-binding  sites.  The  type  1  site  is 
specific  for  the  basic  N-terminal  residues  Arg,  Lys,  and 
His.  The  type  2  site  is  specific  for  the  bulky  hydropho¬ 
bic  N-terminal  residues  Phe,  Leu,  Trp,  Tyr,  and  He  (7). 

The  currently  known  physiological  substrates  of  the 
N-end  rule  pathway  are  the  RNA  polymerase  of  Sindbis 
virus  (23),  the  GR41 -encoded  Ga  subunit  of  the  S.  cerevi¬ 
siae  heterotrimeric  G  protein  (24),  and  Cup9p,  a  short¬ 
lived  transcriptional  repressor  in  S.  cerevisiae  that  con¬ 
trols  the  expression  of  PTR2,  which  encodes  a  peptide 
transporter  (25).  The  Ubrlp-Cup9p-Ptr2p  circuit,  which 
controls  the  import  of  peptides  in  yeast,  is  the  first  clear 
instance  of  a  physiological  function  of  the  N-end  rule 
pathway  (25).  Among  the  cells  of  multicellular  organ¬ 
isms,  this  proteolytic  system  was  characterized  in  rabbit 
reticulocytes  (19)  and  L  cells,  a  line  of  fibroblast-like 
mouse  cells  (26).  Recently,  it  was  suggested  that  the 
N-end  rule  pathway  might  play  a  role  in  apoptosis  (pro¬ 
grammed  cell  death)  (7).  A  way  to  verify  this  conjecture 
would  be  to  use  a  mutant  that  lacks  the  N-end  rule 
pathway.  Such  mutants  were  constructed  and  character¬ 
ized  in  Escherichia  coli  and  S.  cerevisiae  (7)  but  not  yet  in 
multicellular  organisms.  Therefore  we  considered  the  use 
of  N-end  rule  inhibitors. 

Previous  work  has  shown  that  the  addition  of  amino 
acid  derivatives  such  as  dipeptides  that  bear  destabi¬ 
lizing  N-terminal  residues  to  reticulocyte  extract  re¬ 
sults  in  a  strong  and  selective  inhibition  of  the  N-end 
rule  pathway  in  the  extract  (19,  27).  Specifically,  dipep¬ 
tides  bearing  type  1  destabilizing  N-terminal  residues 
inhibited  the  degradation  of  test  N-end  rule  substrates 
bearing  basic  (type  1)  destabilizing  N-terminal  resi¬ 
dues  but  had  no  effect  on  the  degradation  of  substrates 
bearing  type  2  N-terminal  residues.  A  converse  pattern 
was  observed  with  dipeptides  bearing  bulky  hydropho¬ 
bic  (type  2)  destabilizing  N-terminal  residues  (19). 
Dipeptides  added  to  S.  cerevisiae  cultures  have  been 
demonstrated  to  inhibit  the  N-end  rule  pathway  in  vivo 
as  well  (28).  However,  experiments  with  the  N-end  rule 
pathway  of  intact  mammalian  cells  have  shown  dipep¬ 
tides  to  be  ineffective  inhibitors  in  this  setting3. 


3  F.  Levy  and  A.  Varshavsky,  unpublished  data. 


Given  this  latter  constraint,  we  sought  to  explore  the 
physiological  consequences  of  inhibiting  a  metazoan 
N-end  rule  pathway  in  the  inhibitor-accessible  setting 
of  a  cell-free  system.  We  also  wished  to  determine 
whether  the  N-end  rule  pathway  is  present  in  germ 
line  cells  such  as  the  eggs  of  the  amphibian  Xenopus 
laevis,  a  major  experimental  organism  in  studies  of 
embryogenesis  and  cell  cycle  control  (4,  29).  Our  find¬ 
ings  are  described  below. 

MATERIALS  AND  METHODS 

Construction  of  plasmids.  The  plasmids  pT7-UbMeKDHFRhis 
and  pT7-UbReKDHFRhis  expressed  Ub-Met-eK-DHFR-His6  and  Ub- 
Arg-eK-DHFR-His6  fusions  (denoted,  respectively,  as  Ub-Met-DHFR 
and  Ub-Arg-DHFR)  in  E.  coli  from  the  T7  polymerase  promoter  (see 
Results  for  the  definitions  of  eK  and  other  terms).  These  plasmids 
were  constructed  from  pEJJl-M  and  pEJJl-R  (30).  The  Kpnl-Hind- 
III  fragment  of  pEJJl-M  and  pEJJl-R  was  replaced  with  a  synthetic 
oligonucleotide  duplex  (5-CCATCACCATCACCATCACTAAA-3/  and 
5  '-AGCTTTTAGTGATGGTGATGGTGATGGGTAC-3 ' )  that  encoded 
the  His6  tag  and  bore  the  Kpnl  and  Hindlll  overhangs.  The  other 
pT7-UbXeKDHFRhis  plasmids,  which  encoded  otherwise  identi¬ 
cal  Ub-X-DHFR  fusions  bearing  different  X  residues  (Met,  Arg, 
Leu,  Phe,  Cys,  Asp,  or  Asn)  (Fig.  X),  were  constructed  from  pT7- 
UbReKDHFRhis  by  site-directed  mutagenesis,  using  PCR  (31). 

Overexpression ,  labeling,  and  purification  of  Ub -X- e^- DHFR-H is 6 
proteins.  A  pT7-UbXeKDHFRhis  plasmid  was  introduced  into  E. 
coli  BL21  (DE3)  (31).  Protein  expression  was  induced  in  E.  coli  by 
adding  isopropyl-l-thio-/3-D-galactopyranoside  (IPTG),  and  cells 
were  labeled  with  [35Slmethionine/cysteine  (35S-Express,  New  En¬ 
gland  Nuclear,  Boston,  MA),  as  described  (30).  The  labeled  cells  were 
collected  by  centrifugation  and  disrupted  by  sonication,  and  a  35S- 
labeled  Ub-X-eK-DHFR-His6  (Ub-X-DHFR)  test  protein  was  purified 
bv  affinity  chromatography  under  nondenaturing  conditions,  using 
the  Ni-NTA  Spin  Kit  (Qiagen,  Chatsworth,  CA),  and  dialyzed  against 
1  mM  MgOL,  1  mM  dithiothreitol  (DTT),  0.1  M  Tns— HC1  (pH  7.7), 
frozen  rapidly,  and  stored  at  -80°C  in  samples  that  were  to  be 
thawed  only  once.  The  specific  radioactivity  of  [35S] Ub-X-DHFR  pro¬ 
teins  was  5—10  x  105  cpm/pg. 

Degradation  assays  in  Xenopus  egg  extracts.  Cytostatic  factor 
(CSF)-arrested  egg  extracts  were  prepared  from  Xenopus  eggs  as  de¬ 
scribed  (32).  In  most  experiments,  cycloheximide  (0.1  mg/ml)  was  added 
to  the  extracts  to  preclude  reincorporation  of  [^Slmethionine  into  newly 
made  proteins.  In  some  experiments  (see  the  legends  to  figures),  egg 
extracts  were  activated  by  the  addition  of  0.4  mM  CaCl2  1  h  before  the 
assay.  [^S]  Ub-X-DHFR  test  proteins  were  added  to  the  extract  to  the 
final  concentration  of  -25  /xg/ml.  Dipeptides  were  added  together  with 
bestatin  (Sigma,  St.  Louis,  MO)  to  the  final  concentrations  of  10  mM 
and  50  /xg/ml,  respectively.  Bestatin  was  added  to  decrease  the  degra¬ 
dation  of  dipeptides  in  the  extract  (27).  (Control  experiments  (not 
shown)  showed  that  the  addition  of  bestatin  alone  did  not  significantly 
inhibit  the  degradation  of  test  proteins  by  the  N-end  rule  pathway.)  The 
following  dipeptides  and  other  amino  arid  derivatives  were  used:  Arg- 
/3-Ala,  Ala-Lys,  Lys-Ala,  Trp-Ala  (Sigma),  and  Tyr-His  (Bachem  Sci¬ 
ence,  King  of  Prussia,  PA).  Stock  samples  of  dipeptides  were  0.5  M 
solutions  in  10  mM  K-Hepes,  pH  7.5. 

To  follow  the  degradation  of  test  proteins,  reaction  mixtures  (0.1 
ml)  prepared  as  described  above  were  incubated  at  23° C.  Samples 
(2.5  pi)  were  withdrawn  in  duplicate  for  each  time  point.  One  sample 
was  examined  by  SDS-12%  PAGE  and  autoradiography,  using  Phos- 
phorlmager  (Molecular  Dynamics,  Sunnyvale,  CA)  (33).  The  other 
sample  was  used  to  determine  the  relative  amount  (%)  of  35S  soluble 
in  5%  trichloroacetic  acid  (TCA)  (30).  This  parameter  was  calculated 
as  follows: 


THE  N-END  RULE  IN  XENOPUS 


319 


Ub 


DHFR“Hisq 


MQ-RGG|x]HGSGAWlJ_PVSLV[KjR[K]TTLAPNTQTASPRALADSLMQRSGIMVR - KD  GTHHHHHH 

\  \  ^  ^  / 

\ 

\ 

\ 


\ 


\ 


N 


\ 


XH 


DHFR 


FIG.  1.  The  Ub-X-DHFR  test  proteins.  These  fusions,  identical  except  for  the  variable  residue  X  adjacent  to  the  Ub  moiety,  were  expressed 
and  radiolabeled  in  E.  coli,  purified,  and  used  as  described  under  Materials  and  Methods.  X  was  either  Met,  Arg,  Leu,  Phe,  Cys,  Asp,  or  Asn. 
The  E.  coli  Lac  repressor-derived  42-residue  sequence,  denoted  as  eK  (extension  (e)  containing  lysines  (K))  was  described  previously  (8,  30, 
37).  The  alternative  ubiquitylation  sites  Lvs-15  and  Lys-17  are  boxed  in  the  sequence  of  eK  (8).  DHFR  represents  DHFR-His6,  the  mouse 
DHFR  whose  C-terminus  was  extended  by  8  residues  that  included  the  His6  tag. 


X  a 

percent  TCA-soluble  35S  =  y  • j-  *  100%, 

where  X  is  the  amount  of  TCA-soluble  35S  (cpm);  Y  is  the  total 
amount  of  35S  (cpm)  in  the  same  sample;  a  is  the  number  of  Met 
residues  in  a  Ub-X-DHFR  test  protein  (a  =  10  for  Ub-Met-DHFR;  a  = 
9  for  the  other  Ub-X-DHFRs).  The  a  -1  term  above  corrects  for  the 
presence  of  one  Met  residue  in  Ub. 

The  data  presented  in  each  of  the  experimental  Figures  were 
produced  using  the  same  extract  on  the  same  day.  Unless  stated 
otherwise,  the  data  were  consistent  between  at  least  two  (usually 
three  or  more)  independent  experiments  carried  out  with  different 
preparations  of  the  extract. 

In  vitro  apoptosis  assays.  Apoptotic  extracts  from  Xenopus  eggs 
were  prepared  according  to  Newmeyer  et  al.  (34).  The  difference 
between  control  and  apoptotic  egg  extracts  was  in  the  time  intervals 
between  hormone  injections  used  to  produce  the  eggs.  Frogs  were 
injected  with  75  units  of  pregnant  mare  serum  gonadotropin  (Cal- 
biochem,  San  Diego,  CA)  14-30  days  (in  contrast  to  3-10  days  for 
control  extracts)  before  they  were  induced  to  lay  eggs  by  injecting  800 
units  of  human  chorionic  gonadotropin  (Sigma).  Eggs  were  collected, 
and  the  extract  was  prepared  as  described  (32).  The  extracts  were 
used  within  1  h  after  preparation.  For  most  assays,  the  extracts  were 
arrested  in  interphase  by  the  addition  of  0.4  mM  CaCl2  and  0.1 
mg/ml  cycloheximide.  Demembranated Xenopus  sperm  nuclei  (32),  at 
—  1000  per  /il  of  extract,  were  also  added  at  this  time.  Extracts 
treated  as  described  were  distributed  into  0.1 -ml  samples  and  incu¬ 
bated  at  23°C.  Apoptotic  changes  in  the  extracts  were  monitored  in 
two  ways:  by  observing  shrinkage  and  disintegration  of  the  added 
sperm  nuclei  using  phase  contrast  and  fluorescent  microscopy  (34) 
and  by  measuring  the  DEVD-specific  protease  activity.  The  latter 
assay  determined  the  total  activity  of  caspases  that  recognize  the 
tetrapeptide  sequence  DEVD  (Asp-Glu-Val-Asp)  (35,  36). 

The  DEVD-specific  protease  activity  was  measured  by  incubating 
5  pi  of  extract  with  a  quenched  fluorescent  substrate  Ac-DEVD-AMC 
(50  pM)  (Bachem  Bioscience)  in  0.1  ml  of  the  extract  buffer  (15  mM 
MgCl2,  1  mM  DTT,  20  mM  EGTA,  80  mM  Na-j3-gtycerolphosphate, 
pH  7.3).  The  reaction  was  carried  out  for  10  min  at  23°C,  followed  by 
freezing  of  a  20-p.l  sample  in  liquid  N2.  Samples  were  thawed  by 
diluting  into  1  ml  of  phosphate-buffered  saline,  and  their  fluores¬ 
cence  (excitation  at  380  nm  and  emission  at  460  nm)  was  measured 
in  the  Hitachi  F-4500  spectrofluorimeter  (35,  36).  AMC  was  used  as 
a  fluorescence  standard. 

RESULTS 

The  N-End  Rule  Pathway  in  Xenopus  Egg  Extracts 

Among  the  cells  of  multicellular  organisms,  the  N-end 
rule  pathway  has  been  identified  and  analyzed  in  rabbit 


reticulocytes  (19)  and  L  cells,  a  line  of  transformed,  fibro- 
blast-like  mouse  cells  (26).  To  determine  whether  the 
N-end  rule  pathway  is  present  in  germ  line  cells  such  as 
Xenopus  eggs,  we  used  an  approach  similar  to  the  earlier 
one  with  rabbit  reticulocytes  (19,  30). 

A  set  of  N-end  rule  substrates  differing  exclusively 
by  their  N-terminal  residues  was  constructed,  ex¬ 
pressed  in  E.  coli ,  metabolically  labeled  with  [35S]me- 
thionine,  and  purified  by  affinity  chromatography  (see 
Materials  and  Methods).  A  test  protein  of  this  set, 
Ub-X-eK-DHFR-His6,  denoted  below  as  Ub-X-DHFR, 
contained  the  following  parts:  the  N-terminal  Ub  moi¬ 
ety;  a  variable  residue  X;  eK,  a  42-residue,  E .  coli  Lac 
repressor-derived  sequence  that  contained  the  second 
(lysine)  determinant  of  the  N-degron  (8,  37);  and  the 
His6-tagged  mouse  dihydrofolate  reductase  moiety 
(Fig.  1).  Ub  fusions  are  not  cleaved  at  the  Ub-protein 
junction  in  E.  coli ,  which  lacks  the  Ub  system  (38).  By 
contrast,  in  eukaryotes  the  Ub  fusions,  including  the 
Ub-X-DHFRs,  are  rapidly  cleaved  by  UBPs  after  the 
last  residue  of  Ub,  making  it  possible  to  produce,  in 
vivo  or  in  vitro ,  otherwise  identical  test  proteins  such 
as  X-DHFRs  that  differ  exclusively  by  their  N-terminal 
residues  (6,  7). 

CSF-arrested  Xenopus  egg  extracts  were  prepared 
from  unfertilized  frog  eggs  (32).  Specific  35S-labeled 
Ub-X-DHFR  test  proteins  (Fig.  1)  were  added  to  the 
egg  extracts,  and  their  metabolic  fates  were  monitored 
either  by  SDS-PAGE  and  autoradiography  or  by  mea¬ 
suring  the  amount  of  TCA-soluble  35S  released  during 
incubation  at  23°C.  The  identity  of  residue  X  in  the 
tested  X-DHFRs  encompassed  at  least  one  representa¬ 
tive  of  each  class  of  N-terminal  destabilizing  residues: 
Asn  (tertiary  destabilizing);  Asp  and  Cys  (secondary 
destabilizing),  Arg  (type  1  primary  destabilizing),  Phe 
and  Leu  (type  2  primary  destabilizing),  and  Met  (sta¬ 
bilizing)  (see  introduction)  (7). 

As  expected,  a  Ub-X-DHFR  test  protein  was  deubiq- 
uitylated  upon  its  addition  to  the  extract.  This  was 
detected  through  an  increase  in  the  electrophoretic 
mobility  of  the  major  35S-labeled  species  and  the  con- 
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FIG.  2.  Deubiquitylation  and  degradation  of  Ub-X-DHFR  test  proteins  in  Xenopus  egg  extract.  (A)  S-labeled,  punfi ed 3 
added  to  a  CSF-arrested  Xenopus  egg  extract  and  incubated  at  23°C  for  the  indicated  times.  The  ^ 

the  labeled  species  were  detected  by  autoradiography  (see  Materials  and  Methods).  (B)  Same  as  A  ^u*wlt^RP^'DFnRe  {  \  b  t 
but  with  Ub-Met-DHFR  (only  the  area  around  Met-DHFR  is  shown).  (D)  Same  as  C,  but  with  Ub-Asp-DHFR.  (E)  Same  as  C,  but  with 
Ub-Asn-DHFR.  Open  arrowheads  indicate  the  bands  of  the  initial,  34-kDa  Ub-X-DHFR.  Closed  arrowheads  indicate  the  bands  of  deubiq- 
uitylated,  26-kDa  X-DHFR.  An  asterisk  indicates  the  band  of  free  [35S]Ub,  which  migrated  close  to  the  dye  front  in  this  electropho 
system.  The  molecular  masses  (in  kDa)  of  protein  markers  are  indicated  to  the  right  of  B. 


comitant  appearance  of  a  labeled  ~8  kDa  species  (Fig. 
2  and  data  not  shown).  Deubiquitylation  of  Ub-X- 
DHFR  in  the  extract  was  essentially  complete  in  5  min 
or  less  (Fig.  2).  Whereas  the  relative  intensity  of  the 
free  Ub  band  remained  nearly  constant  during  the 
subsequent  4-h  incubation  (Figs.  2A  and  2B,  and  data 
not  shown),  the  relative  intensity  of  the  X-DHFR  band 
was  either  constant  or  decreased,  at  a  varying  rate, 
depending  on  the  identity  of  the  N-terminal  residue  X. 
Based  on  their  relative  metabolic  stabilities  in  the  ex¬ 
tract  (Figs.  2  and  3A,  and  data  not  shown),  the  X- 
DHFR  test  proteins  could  be  reproducibly  ranked  from 
the  long-lived  to  the  short-lived  as  follows:  Met  > 
Cys  >  Leu  >  Asp  >  Asn  >  Phe  >  Arg.  We  conclude 
that  Xenopus  egg  extracts  contain  an  N-end  rule  path¬ 
way  whose  rule  book  is  similar,  and  may  be  identical, 
to  those  found  in  rabbit  reticulocytes  (19)  and  mouse  L 
cells  (26). 

The  decay  curves  of  X-DHFRs  that  were  determined 
by  monitoring  the  release  of  TCA-soluble  35S  and  the 
resulting  ranking  of  their  N-terminal  destabilizing  res¬ 
idues  were  reproducible  between  experiments  that  uti¬ 
lized  independent  preparations  of  the  egg  extract  (data 
not  shown).  Specifically,  it  took  —10  min  to  destroy 
50%  of  Arg-DHFR  to  acid-soluble  fragments.  By  30 
min,  the  degradation  of  Arg-DHFR  was  —90%  com¬ 
plete  (Figs.  2A  and  3A). 

Approximately  20%  of  the  added  Met-DHFR,  which 
bore  a  stabilizing  N-terminal  residue  (7),  was  degraded 
during  the  first  10  min  of  incubation,  and  another  5  to 


15%  was  degraded  over  the  next  30  min.  However,  the 
rest  of  the  Met-DHFR  remained  stable  in  the  extract 
(Figs.  2C  and  3A).  A  likely  explanation  of  this  result  is 
that  our  preparation  contained  two  types  of  Met-DHFR 
molecules:  -70%  were  undamaged  and  long-lived  in 
the  extract,  as  observed  with  Met-DHFR  in  vivo,  in 
both  S.  cerevisiae  and  mouse  L  cells  (Ref.  8;  F .  Levy  and 
A.  V.,  unpubl.  data),  whereas  the  remainder  comprised 
misfolded  or  otherwise  damaged  Met-DHFR  molecules 
that  were  degraded  by  a  proteolytic  system  distinct 
from  the  N-end  rule  pathway.  By  inference,  the  same 
background  of  degradation  of  a  subpopulation  of  mol¬ 
ecules  would  be  expected  for  other  X-DHFRs.  It  is 
unclear  whether  a  subpopulation  of  misfolded  or  oth¬ 
erwise  damaged  X-DHFRs  resulted  from  their  overex¬ 
pression  in  E.  coli,  or  whether  the  damage  occurred 
largely  during  purification  of  X-DHFRs.  Independent 
evidence  for  this  explanation  of  the  background  degra¬ 
dation  of  Met-DHFR  is  presented  below. 

Previous  work  has  shown  that  N-terminal  Cys  is  a 
stabilizing  residue  in  S.  cerevisiae  (8),  but  is  a  second¬ 
ary  destabilizing  residue  in  rabbit  reticulocyte  extracts 
and  mouse  L  cells  (19,  26).  In  Xenopus  egg  extracts, 
Cys  was  found  to  be  a  weakly  destabilizing  residue. 
Specifically,  among  the  tested  X-DHFRs,  Cys-DHFR 
was  the  longest-lived  test  protein  save  for  Met-DHFR, 
which  bore  a  stabilizing  N-terminal  residue  (Fig.  3A). 
Substrates  that  bear  secondary  or  tertiary  destabiliz¬ 
ing  residues  require  the  tRNA-dependent  activity  of 
R-transferase  for  their  degradation  by  the  N-end  rule 
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FIG.  3.  Degradation  of  X-DHFR  test  proteins  in  egg  extracts  under  different  conditions,  as  determined  by  measuring  TCA-soluble  35S.  (A) 
35S-labeled,  purified  Ub-Arg-DHFR  (G),  Ub-Phe-DHFR  (O),  Ub-Met-DHFR  (A),  Ub-Asp-DHFR  (O),  and  Ub-Asn-DHFR  (EB)  were  added  to 
samples  of  the  CSF-arrested  Xenopus  egg  extract.  After  incubation  at  23°C  for  the  indicated  times,  the  amounts  of  35S  soluble  in  5%  TCA  were 
determined  for  each  time  point  (see  Materials  and  Methods).  Since  the  free  [35S]Ub,  produced  through  deubiquitylation  of  a  Ub-X-DHFR,  was 
stable  during  the  incubation,  100%  of  TCA-soluble  35S  represented  complete  degradation  of  the  Ub-lacking  [35S]X-DHFR  (see  Materials  and 
Methods  for  further  details).  (B)  Pretreatment  of  egg  extract  with  RNAase  selectively  inhibits  degradation  of  N-end  rule  substrates  bearing 
secondary  destabilizing  N-terminal  residues.  A  CSF-arrested  extract  was  pretreated  with  5  units/ml  of  RNAase  A- agarose  (Sigma)  for  1  h  at 
23°C,  followed  by  the  removal  of  RNAas  e-agarose  by  centrifugation  ( closed  symbol  curves).  Another  sample  of  the  same  extract  was  incubated 
for  1  h  without  RNAase  ( open  symbol  curves).  35S-labeled  Ub-Cys-DHFR  {triangles),  Ub-Asp-DHFR  (circles),  or  Ub-Arg-DHFR  ( squares )  were 
added  to  both  extracts,  and  the  relative  amounts  of  TCA-soluble  35S  released  during  the  incubation  were  determined  as  in  A.  (C)  Different 
states  of  egg  extract  do  not  significantly  alter  the  activity  of  the  N-end  rule  pathway.  35S-labeled  Ub-Phe-DHFR  was  added  to  the 
CSF-arrested  egg  extract  (□),  to  the  CaCl2-treated  extract  that  proceeded  toward  mitosis  (A),  and  to  the  CaCl2-  and  cyclohexiniide-treated, 
interphase- arrested  extract  (O).  The  degradation  of  Phe-DHFR  was  monitored  as  described  in  A. 


pathway  (see  introduction).  Previous  work  has  shown 
that  pretreatment  of  reticulocyte  extract  with  RNAase 
A  does  not  significantly  affect  the  degradation  of  N-end 
rule  substrates  bearing  primary  destabilizing  residues 
but  abolishes  the  degradation  of  otherwise  identical 
substrates  bearing  secondary  or  tertiary  destabilizing 
N-terminal  residues  (19).  In  agreement  with  these 
findings,  preincubation  of  Xenopus  egg  extract  with 
RNAase  A  selectively  inhibited  the  degradation  of  Cys- 
DHFR  and  Asp-DHFR,  but  had  no  effect  on  the  degra¬ 
dation  of  Arg-DHFR  (Fig.  3B).  We  conclude  that  both 
Cys  and  Asp  are  secondary  destabilizing  residues  in 
the  N-end  rule  pathway  of  Xenopus  eggs.  Taken  to¬ 
gether  with  the  earlier  findings  about  the  N-end  rule 
pathway  in  reticulocyte  extract  (19),  the  present  find¬ 
ings  are  consistent  with  the  conjecture  that  the  set  of 
secondary  and  tertiary  destabilizing  residues  in  the 
N-end  rule  of  Xenopus  eggs  is  identical  to  that  of  rabbit 
reticulocytes:  two  tertiary  destabilizing  residues  (Asn 
and  Gin)  and  three  secondary  ones  (Asp,  Glu,  and  Cys). 

We  also  asked  whether  the  rate  of  degradation  of  an 
N-end  rule  substrate  depends  on  physiologically  rele¬ 
vant  changes  in  the  state  of  an  egg  extract.  Unfertil¬ 
ized  Xenopus  eggs  are  arrested  at  the  metaphase  of 
meiosis  II  through  the  action  of  cytostatic  factor  (39, 
40).  Fertilization  results  in  a  transient  increase  of  the 
Ca2+  concentration  in  the  eggs  that  inactivates  CSF, 
allowing  cells  to  enter  interphase  and  proceed  to  mito¬ 
sis.  This  process  can  be  mimicked  in  vitro  by  the  addi¬ 
tion  of  Ca2+  (32).  Such  extracts,  referred  to  as  cycling 
extracts,  undergo  changes  characteristic  of  eggs  at  mi¬ 


tosis,  as  observed  through  alterations  in  the  morphol¬ 
ogy  of  Xenopus  sperm  nuclei  added  to  the  extract.  The 
addition  of  cycloheximide  and  CaCl2  to  a  CSF-arrested 
extract  results  not  only  in  the  escape  from  CSF  arrest 
but  also  in  a  subsequent  arrest  at  interphase,  owing  to 
the  inhibition  of  protein  synthesis,  specifically  the  syn¬ 
thesis  of  cyclins  (32).  We  examined  the  degradation  of 
Phe-DHFR,  bearing  a  type  2  primary  destabilizing  N- 
terminal  residue,  in  the  CSF-arrested,  interphase-ar¬ 
rested,  and  cycling  egg  extracts,  and  found  no  signifi¬ 
cant  differences  in  the  rate  of  degradation  of  this  N-end 
rule  substrate  in  different  extracts  (Fig.  30. 

Dipeptides  Bearing  Destabilizing  N-Terminal 

Residues  Are  Efficacious  Inhibitors  of  the  N-End 

Rule  Pathway  in  Xenopus  Egg  Extracts 

Previous  work  has  shown  that  the  addition  of  amino 
acid  derivatives  such  as  dipeptides  bearing  destabiliz¬ 
ing  N-terminal  residues  to  reticulocyte  extracts  or  in¬ 
tact  yeast  cells  results  in  a  strong  and  selective  inhibi¬ 
tion  of  the  N-end  rule  pathway  in  these  settings  (19, 
27).  However,  experiments  with  intact  mammalian 
cells  in  culture  have  shown  dipeptides  to  be  at  most 
weak  inhibitors  of  the  N-end  rule  pathway  in  this 
setting3.  The  present  work  stemmed  in  part  from  the 
possibility  of  exploring  the  physiological  consequences 
of  perturbing  a  metazoan  N-end  rule  pathway  in  the 
inhibitor-accessible  setting  of  a  cell-free  system  such  as 
a  Xenopus  egg  extract. 
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FIG.  4.  Inhibition  of  the  N-end  rule  pathway  in  egg  extracts  by 
dipeptides  bearing  destabilizing  N-terminal  residues.  (A)  35S-la- 
beled,  purified  Ub-Arg-DHFR  was  added  to  CSF-arrested  Xenopus 
egg  extracts  in  the  absence  (O  or  the  presence  of  the  following 
dipeptides  110  mM),  together  with  bestatin  (50  #ig/ml):  Arg-0-Ala  (O), 
Lys-Ala  (O),  Tyr-His  (A),  Trp-Ala  (SB),  and  Ala-Lys  (X).  After  incu¬ 
bation  at  23°C  for  the  indicated  times,  the  amounts  of  35S  soluble  in 
5%  TCA  were  determined  for  each  time  point.  (B)  Same  as  in  A  but 
with  Ub-Phe-DHFR. 


As  shown  in  Figs.  4  and  5,  Lys-Ala  and  especially 
Arg-/3-Ala,  both  of  which  bear  a  type  1  primary  desta¬ 
bilizing  N-terminal  residue,  strongly  inhibited  the  deg¬ 
radation  of  Arg-DHFR  in  the  egg  extract.  This  inhibi¬ 
tion  was  selective,  in  that  the  same  dipeptides  had  no 
significant  effect  on  the  degradation  of  Phe-DHFR, 
which  bore  a  type  2  primary  destabilizing  N-terminal 
residue.  Conversely,  Tyr-His  and  Trp-Ala,  which  bear 
type  2  destabilizing  N-terminal  residues,  strongly  in¬ 
hibited  the  degradation  of  Phe-DHFR,  but  had  at  most 
a  small  effect  on  the  degradation  of  Arg-DHFR  (Figs.  4 
and  5).  Ala-Lys,  which  bears  a  type  3  destabilizing 
N-terminal  residue  (19),  did  not  inhibit  the  degrada¬ 
tion  of  either  Arg-  or  Phe-DHFRs  (Fig.  4). 

The  dipeptides  had  little  effect  on  the  degradation  of 
their  cognate  N-end  rule  substrates  during  the  first  15 
min  of  the  incubation,  but  subsequent  proteolysis  was 
strongly  inhibited  (Fig.  4).  This  result  is  consistent 
with  the  finding  that  20-30%  of  Met-DHFR,  which 
bears  a  stabilizing  N-terminal  residue,  was  rapidly 
degraded,  whereas  the  rest  of  the  added  Met-DHFR 
was  stable  (Fig.  3A).  As  described  above,  the  latter 
finding  could  be  accounted  for  if  our  preparation  of 
Met-DHFR  (and,  by  inference,  of  other  X-DHFRs)  com¬ 
prised  two  distinct  populations  of  molecules:  undam¬ 
aged  ones  (—70%)  and  hence,  in  the  case  of  Met-DHFR, 
long-lived  in  the  extract,  and  misfolded  or  otherwise 
damaged  molecules  that  were  recognized  and  degraded 
by  a  proteolytic  system  distinct  from  the  N-end  rule 
pathway.  The  failure  of  dipeptides  to  inhibit  the  initial 
burst  of  degradation  of  their  cognate  X-DHFRs  (Fig.  4) 
is  in  agreement  with  this  explanation. 


Apoptosis  and  the  N-End  Rule  Pathway 

Several  groups  have  described  the  use  of  Xenopus 
egg  extracts  to  analyze  the  process  of  apoptosis  (34,  36, 
41_43).  After  several  hours  of  incubation  of  the  inter¬ 
phase-arrested,  apoptotic  egg  extracts,  the  activity  of 
DEVD-specific  caspases  (a  subgroup  in  the  ICE/CED-3 
family  of  cysteine  proteases)  (43)  rises  sharply,  fol¬ 
lowed  by  fragmentation  of  the  added  sperm  nuclei  (34). 
Both  the  induction  of  caspases  and  the  fragmentation 
of  the  added  nuclei  in  the  egg  extract  can  be  suppressed 
by  the  addition  of  Bcl2,  a  protein  that  inhibits  apopto¬ 
sis  in  vivo  (34,  36).  Recent  work  has  shown  that  apo¬ 
ptotic  changes  in  egg  extract  are  triggered,  through 
unknown  cytosolic  factors,  by  the  release  of  cytochrome 
c  from  the  mitochondrial  fraction  of  the  extract  (43). 
Apparently  Bcl2  inhibits  apoptosis  by  blocking  the  re¬ 
lease  of  cytochrome  c  from  mitochondria  (43,  44). 

We  wished  to  determine  whether  the  apoptosis-like 
events  in  egg  extracts  would  be  perturbed  by  dipeptide 
inhibitors  of  the  N-end  rule  pathway.  In  preliminary 
experiments,  we  found  that  the  times  of  apoptotic 
changes  in  egg  extracts  varied  considerably  between 
independent  extract  preparations.  Specifically,  the 
sharp  rise  of  the  DEVD-specific  protease  activity  and 
the  disintegration  of  the  added  sperm  nuclei  could  oc¬ 
cur  as  early  as  2  h  after  the  start  of  incubation  or  as 
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Arg-p-Ala  Arg-p-Ala 


Trp-Ala  Trp-Ala 


FIG.  5.  Electrophoretic  analysis  of  N-end  rule  substrates  in  egg 
extracts  in  the  presence  of  dipeptides.  (A)  35S-labeled,  purified  Ub- 
Arg-DHFR  was  added  to  a  CSF-arrested  Xenopus  egg  extract  in  the 
absence  or  the  presence  of  dipeptides  Arg-/3-Ala  or  Trp-Ala  (at  10 
mM,  together  with  bestatin  at  50  ^g/ml).  After  incubation  at  23°C  for 
the  indicated  times,  the  samples  were  analyzed  by  SDS-PAGE,  and 
the  labeled  proteins  were  detected  by  autoradiography.  Open  arrow¬ 
heads  indicate  the  bands  of  the  initial,  34-kDa  Ub-Arg-DHFR. 
Closed  arrowheads  indicate  the  bands  of  deubiquitylated,  26-kDa 
Arg-DHFR.  (B)  Same  as  in  A  but  with  Ub-Phe-DHFR. 
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FIG.  6.  Apoptotic  changes  in  Xenopus  egg  extracts,  assayed  by 
measuring  the  DEVD-specific  protease  activity.  (A)  A  preparation  of 
the  CSF-arrested  egg  extract  was  divided  into  three  samples.  The 
extracts  were  left  untreated  (□)  or  were  activated  by  either  0.4  mM 
CaCl2  (O)  or  0.4  mM  CaCl2  in  the  presence  of  0.1  mg/ml  cyclohexi- 
mide  (O).  The  samples  were  incubated  at  room  temperature  for  up  to 
5  h.  The  DEVD-specific  protease  activity  in  the  extracts  was  deter¬ 
mined  using  the  fluorogenic  substrate  Ac-DEVD-AMC.  (B)  Samples 
of  interphase-arrested  egg  extract  were  incubated  either  in  the  ab¬ 
sence  (O)  or  in  the  presence  of  the  following  reagents:  Lys-Ala  at  20 
mM  (E);  Ala-Lys  at  20  mM  (O);  Trp-Ala  at  20  mM  (X);  Trp-Ala  plus 
Lys-Ala,  each  at  10  mM  W»;  Ac-DEVD-CHO  (an  inhibitor  of  DEVD 
proteases,  at  10  mM)  (A).  The  dipeptides  were  added  together  with 
bestatin  (50  /ag/ml).  The  DEVD-specific  protease  activity  was  mea¬ 
sured  as  in  A. 


late  as  5  h  (data  not  shown).  This  variability,  reported 
by  others  as  well  (43),  is  likely  to  result  in  part  from  the 
variability  of  the  timing  of  upstream  events  that  trig¬ 
ger  the  beginning  of  apoptosis  in  the  Xenopus  egg  ex¬ 
tract.  In  addition,  the  timing  of  apoptotic  changes  de¬ 
pended  on  the  state  of  the  extracts:  when  a  single 
preparation  was  split  into  three  samples,  and  was  used 
as  CSF-arrested,  cycling,  and  interphase-arrested  ex¬ 
tracts,  the  increase  of  the  DEVD-specific  protease  ac¬ 
tivity  was  observed  first  in  the  interphase-arrested 
extract,  then  in  the  CSF-arrested  one,  and  later  in  the 
cycling  extract  (Fig.  6A).  This  pattern  was  reproducible 
between  different  preparations  of  the  extract,  even 
though  the  absolute  timing  of  the  rise  of  the  DEVD- 
specific  protease  activity  varied  significantly  between 
preparations  (data  not  shown). 

We  asked  whether  selective  inhibition  of  the  N-end 
rule  pathway  would  affect  the  nature  or  kinetics  of  the 
apoptotic  changes.  Dipeptides  that  had  been  shown  to 
be  efficacious  inhibitors  of  the  N-end  rule  pathway 
(Figs.  2  and  3A)  were  added  to  the  interphase-arrested 
and  cycling  extracts  at  the  beginning  of  incubation.  We 
found  that  Lys-Ala  and  Arg-/3-Ala,  containing  type  1 
primary  destabilizing  N-terminal  residues,  and  Ala- 
Lys,  containing  a  type  3  destabilizing  residue,  did  not 
alter  the  timing  of  DEVD-specific  protease  induction 
(Fig.  6B).  In  contrast,  Trp-Ala,  which  bears  a  type  2 
destabilizing  N-terminal  residue,  delayed  both  the  rise 
of  the  DEVD-specific  protease  activity  and  the  disinte¬ 
gration  of  sperm  nuclei  by  ~1  h  in  8  out  of  12  indepen¬ 


dent  experiments  (Fig.  6B  and  data  not  shown).  The 
~1  h  delay  of  apoptotic  changes  by  Trp-Ala  was  signif¬ 
icant  but  weak  in  comparison  to  the  effect  of  Ac-DEVD- 
CHO,  an  inhibitor  of  DEVD-specific  proteases,  which 
delayed  apoptosis  by  at  least  2  h  (data  not  shown). 
Interestingly,  the  proteasome  inhibitor  5-iodo-4-hy- 

droxyl-3-nitrophenylacetyl-leucinyl-leucinyl-leucine- 

vinyl  sulfone  (NIP-L3VS)  (45),  which  inhibited  the  deg¬ 
radation  of  Phe-DHFR  in  egg  extracts,  also  delayed 
apoptosis  in  the  CSF-arrested  extract  by  ~1  h  (data 
not  shown),  similarly  to  the  effect  of  Trp-Ala  (Fig.  6B). 

DISCUSSION 

We  report  the  following  results: 

(1)  A  whole-cell  extract  ofX.  laevis  eggs  contains  the 
N-end  rule  pathway,  as  determined  by  monitoring  the 
metabolic  fates  of  purified,  35S-labeled  Ub-X-DHFR 
test  proteins  added  to  the  extract.  Among  the  cells  of 
multicellular  organisms,  the  N-end  rule  pathway  has 
been  identified  and  characterized  in  rabbit  reticulo¬ 
cytes  (18)  and  L  cells,  a  mouse  cell  line  (19,  26),  but  not, 
until  now,  in  germ  line  cells. 

(2)  The  partial  N-end  rule  defined  in  egg  extracts 
was  found  to  be  identical  to  the  corresponding  subsets 
of  the  N-end  rules  in  rabbit  reticulocytes  and  mouse 
fibroblasts.  In  particular,  we  showed  that  in  egg  ex¬ 
tracts  the  destabilizing  activity  of  N-terminal  Asn, 
Asp,  and  Cys  requires  the  presence  of  RNA  (presum¬ 
ably  tRNA),  strongly  suggesting  that  these  are  second¬ 
ary  (Asp  and  Cys)  and  tertiary  (Asn)  residues  in  the 
egg’s  N-end  rule  pathway,  as  they  have  been  shown  to 
be  in  other  metazoan  cells  (see  introduction  for  the 
terminology)  (7). 

(3)  The  activity  of  the  N-end  rule  pathway  did  not 
change  significantly  upon  shifts  of  the  CSF-arrested  egg 
extract  to  the  cycling  or  the  interphase-arrested  state. 

(4)  Dipeptides  bearing  destabilizing  N-terminal  res¬ 
idues  selectively  and  efficiently  inhibited  the  degrada¬ 
tion  of  N-end  rule  substrates  in  egg  extract.  In  partic¬ 
ular,  dipeptides  bearing  the  basic  (type  1)  primary 
destabilizing  N-terminal  residues  inhibited  the  degra¬ 
dation  of  N-end  rule  substrates  bearing  type  1  but  not 
type  2  N-terminal  residues.  A  converse  inhibition  pat¬ 
tern  was  observed  with  dipeptides  bearing  bulky  hy¬ 
drophobic  (type  2)  N-terminal  residues. 

(5)  The  onset  of  apoptosis-like  changes  in  egg  extract 
(fragmentation  of  the  added  sperm  nuclei  and  abrupt 
increase  of  the  DEVD-specific  protease  activity  (34, 
43))  was  found  to  be  delayed  by  ~1  h  in  the  presence  of 
dipeptide  Trp-Ala,  which  bears  a  type  2  destabilizing 
N-terminal  residue,  but  not  in  the  presence  of  dipep¬ 
tides  bearing  type  1  destabilizing  N-terminal  residues. 

The  N-end  rule  pathway  has  been  identified  in  all 
organisms  examined,  from  mammals  to  fungi  and  bac- 
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teria,  but  the  understanding  of  its  functions  remains 
incomplete  (6,  7).  The  first  clear  example  of  a  physio¬ 
logical  function  of  the  N-end  rule  pathway  is  its  re¬ 
cently  identified  role  in  controlling  the  import  of  pep¬ 
tides  in  S.  cerevisiae  through  the  degradation  of  the 
transcriptional  repressor  Cup9p  that  down-regulates  a 
peptide  transporter  (25,  46).  It  has  also  been  reported 
that  dipeptides  bearing  destabilizing  N-terminal  resi¬ 
dues  specifically  inhibit  differentiation  of  rat  pheochro- 
mocytoma  PC12  cells  (47),  the  neurite  outgrowth  in 
amphibian  neuroepithelial  cells  (48),  and  limb  regen¬ 
eration  in  the  newt  (49),  suggesting  a  role  for  the  N-end 
rule  pathway  in  these  biological  processes.  Unfortu¬ 
nately,  none  of  these  studies  (47-49)  included  control 
experiments  to  verify  that  the  added  dipeptides  actu¬ 
ally  inhibited  degradation  of  a  reporter  N-end  rule 
substrate  in  the  target  cells.  Thus,  it  remains  to  be 
determined  whether  the  observed  biological  effects  of 
dipeptides  (47-49)  were  caused  by  specific  inhibition  of 
the  N-end  rule  pathway  in  the  target  cells  or  resulted 
from  other  effects  of  the  added  dipeptides. 

Apart  from  the  interest  in  determining  whether  the 
N-end  rule  pathway  is  present  in  germ  line  cells  such 
as  Xenopus  eggs,  one  reason  for  initiating  this  work 
was  the  opportunity  to  test,  through  selective  inhibi¬ 
tion  of  the  N-end  rule  pathway  in  egg  extracts,  whether 
this  pathway  plays  a  role  in  apoptosis.  Several  lines  of 
evidence  suggest  a  role  for  the  Ub/proteasome  system, 
of  which  the  N-end  rule  pathway  is  a  part,  in  regulat¬ 
ing  apoptosis  (50-55).  In  particular,  cell-penetrating 
proteasome  inhibitors  were  reported  to  partially  pro¬ 
tect  nonproliferating  cells  such  as  thymocytes  and 
growth  factor-deprived  sympathetic  neurons  from  ap¬ 
optosis  (52,  53).  In  contrast,  the  same  proteasome  in¬ 
hibitors  were  reported  to  induce  apoptosis  in  mitoti- 
cally  active  cells  (51,  54).  An  apoptosis-like  process, 
induced  by  the  stress  of  starvation  and  mediated  by 
proteolysis,  was  also  identified  in  E.  coli  (56).  A  key 
regulator  of  E.  coli  apoptosis  is  a  short-lived  protein 
MazE,  which  is  degraded  by  ClpAP,  a  proteasome-like 
ATP-dependent  protease  that  targets  N-end  rule  sub¬ 
strates  in  E.  coli  (38).  These  and  other  considerations, 
including  the  possibility  that  cleavages  by  caspases  can 
produce  N-end  rule  substrates,  led  to  the  suggestion 
that  the  N-end  rule  pathway  may  function  in  the  con¬ 
trol  of  apoptosis  (7). 

Specific  dipeptides,  which  have  been  shown  to  act  as 
selective  inhibitors  of  the  N-end  rule  pathway  in  re¬ 
ticulocyte  extracts  (19,  27)  and  intact  S.  cerevisiae  cells 
(28),  were  found  to  be  largely  ineffective  with  several 
lines  of  mammalian  cells3.  Xenopus  egg  extracts,  in 
contrast  to  reticulocyte  extracts,  have  been  shown  to 
undergo  apoptosis-like  changes  (34,  36,  43),  hence  the 
choice  of  these  extracts  for  the  present  study.  We  found 
that  dipeptides  bearing  either  type  1  (Arg,  Lys)  or  type 
3  (Ala)  destabilizing  N-terminal  residues  had  no  effect 


on  the  apoptotic  changes  in  the  extracts.  In  contrast, 
Trp-Ala,  which  bears  a  type  2  destabilizing  N-terminal 
residue,  inhibited  apoptosis  in  a  number  of  indepen¬ 
dent  experiments  with  different  preparations  of  egg 
extracts;  in  a  minority  of  tests  the  effect  was  not  ob¬ 
served.  Tyr-His,  a  dipeptide  bearing  another  type  2 
destabilizing  N-terminal  residue,  strongly  inhibited 
the  degradation  of  an  N-end  rule  substrate  such  as 
Phe-DHFR  in  the  extract,  but  was  significantly  less 
efficacious  than  Trp-Ala  in  its  effect  on  the  timing  of 
apoptotic  changes  in  the  same  extract.  Taken  together, 
the  results  of  experiments  with  dipeptides  as  inhibitors 
of  apoptosis  indicate  that  some  inhibitors  of  the  N-end 
rule  pathway,  notably  Trp-Ala,  can  cause  a  significant 
delay  of  apoptosis  in  the  extract.  A  definitive  test  and 
fiirt, her  analysis  of  the  thus  suggested  function  of  the 
N-end  rule  pathway  in  apoptosis  will  require  mutants 
that  eliminate  this  pathway  in  a  multicellular  organ¬ 
ism  without  perturbing  the  rest  of  the  Ub  system. 
Construction  of  a  mouse  mutant  that  lacks  the  entire 
N-end  rule  pathway  (through  a  deletion  of  the  Ubrl 
gene  that  encodes  N-recognin,  the  main  recognition 
component  of  this  proteolytic  system)  is  under  way4. 

In  summary,  we  established  the  existence  of  the 
N-end  rule  pathway  in  germ  line  cells  such  as  Xenopus 
eggs,  showed  that  dipeptides  are  efficacious  inhibitors 
of  the  N-end  rule  pathway  in  egg  extracts,  and  exam¬ 
ined  the  effects  of  dipeptides  on  apoptotic  changes  in 
these  extracts. 
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Summary 

We  report  the  discovery  of  a  short-lived  chaperone 
that  is  required  for  the  correct  maturation  of  the  eukar¬ 
yotic  20S  proteasome  and  is  destroyed  at  a  specific 
stage  of  the  assembly  process.  The  S.  cerevisiae  Umpl  p 
protein  is  a  component  of  proteasome  precursor  com¬ 
plexes  containing  unprocessed  p  subunits  but  is  not 
detected  in  the  mature  20S  proteasome.  Upon  the  as¬ 
sociation  of  two  precursor  complexes,  Umplp  is  en¬ 
cased  and  is  rapidly  degraded  after  the  proteolytic 
sites  in  the  interior  of  the  nascent  proteasome  are 
activated.  Cells  lacking  Umpl  p  exhibit  a  lack  of  coordi¬ 
nation  between  the  processing  of  p  subunits  and  pro¬ 
teasome  assembly,  resulting  in  functionally  impaired 
proteasomes.  We  also  show  that  the  propeptide  of 
the  Pre2p/Doa3p  p  subunit  is  required  for  Umpl  p’s 
function  in  proteasome  maturation. 

Introduction 

Ubiquitin  (Ub)-dependent  proteolysis  underlies  the  bulk 
of  nonlysosomal  protein  degradation  in  eukaryotic  cells 
(reviewed  by  Hochstrasser,  1 996;  Varshavsky,  1 997).  Natu¬ 
rally  short-lived  as  well  as  damaged  or  otherwise  abnor¬ 
mal  proteins  are  recognized  by  the  Ub  system  and  are 
marked  for  degradation  by  the  attachment  of  multi-Ub 
chains.  Ubiquitylated  proteins  are  degraded  by  the  26S 
proteasome,  an  ^2000  kDa,  multisubunit,  ATP-depen- 
dent  protease  that  consists  of  the  1 9S  complex,  which  is 
required  specifically  for  the  degradation  of  ubiquitylated 
proteins,  and  an  ~700  kDa  complex,  called  the  20S 
proteasome,  which  is  the  ATP-independent  catalytic 
core  of  the  26S  proteasome  (reviewed  by  Peters,  1 994; 
Coux  et  al.,  1996;  Lupas  et  al.  1997). 

The  20S  proteasome  is  universal  among  eukaryotes; 
its  structural  homologs  have  also  been  found  in  archae¬ 
ons  and  eubacteria  (reviewed  by  Coux  et  al.,  1 996).  High- 
resolution  crystal  structures  have  been  reported  for  the 
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20S  proteasomes  of  the  archaeon  Thermoplasma  acido - 
philum  and  the  eukaryote  Saccharomyces  cerevisiae 
(Lowe  et  al.,  1 995;  Groll  et  al.,  1 997).  The  Thermoplasma 
proteasome  contains  two  types  of  subunits,  a  and  p, 
which  form  a  hollow  cylinder  composed  of  four  hepta- 
meric  rings  in  the  configuration  a7p7p7a7.  The  yeast  20S 
proteasome  has  a  similar  structure  but  contains  14  dis¬ 
tinct  subunits,  seven  of  the  a  type  and  seven  of  the  p 
type.  The  N-terminal  threonines  of  the  p  subunits  of 
the  Thermoplasma  proteasome  act  as  nucleophiles  in 
catalyzing  the  hydrolysis  of  peptide  bonds  of  polypep¬ 
tide  substrates  (SeemCiller  et  al.,  1995).  Lys-33  and  Glu- 
17  of  the  p  subunit  also  play  a  role  in  catalyzing  the 
cleavage  of  peptide  bonds.  The  14  identical  p  subunits 
of  the  Thermoplasma  proteasome  form  1 4  identical  ac¬ 
tive  sites,  which  catalyze  the  cleavage  of  substrates 
after  hydrophobic  amino  acid  residues  (chymotrypsin- 
like  active  sites).  These  sites  are  located  on  the  inner 
surface  of  a  central  chamber  formed  by  the  two  rings 
of  p  subunits  (Lowe  et  al.,  1995;  Seemuller  et  al.,  1995). 
p  subunits  are  synthesized  as  inactive  precursors  con¬ 
taining  a  propeptide  that  is  thought  to  be  cleaved  off 
autocatalytically,  yielding  the  mature  p  subunit  bearing 
N-terminal  Thr.  The  propeptides  of  p  subunits  are  not 
required  for  the  in  vitro  assembly  of  the  Thermoplasma 
proteasome  (Seemuller  et  al.,  1996). 

In  eukaryotic  proteasomes,  only  3  of  the  7  distinct  p 
subunits  contain  the  three  conserved  residues  required 
for  activity  of  the  Thermoplasma  proteasome.  Genetic 
and  structural  data  suggest  that  these  three  subunits 
provide  the  active-site  nucleophiles  for  the  three  distinct 
catalytic  activities  of  eukaryotic  proteasomes,  namely 
the  “chymotrypsin-like”  (see  above),  the  “trypsin-like” 
(cleavage  after  basic  residues),  and  the  “peptidylgluta- 
myl  peptide  hydrolyzing  (PGPH)”  activity  (cleavage  after 
acidic  residues).  In  S.  cerevisiae ,  these  three  p  subunits 
are  Pre2p/Doa3p,  Puplp,  and  Pre3p  (Heinemeyer  et  al., 
1993;  Chen  and  Hochstrasser,  1996;  Arendt  and  Hoch¬ 
strasser,  1997;  Groll  et  al.,  1997;  Heinemeyer  et  al., 
1997).  In  the  human  20S  proteasome,  the  related  pro¬ 
teins  are  MB1,  5,  and  Z;  during  the  immune  response, 
these  subunits  can  be  replaced  by  their  respective  ho¬ 
mologs  LMP7,  LMP2,  and  M ECL-1/LM PI  0  (reviewed  by 
Coux  et  al.,  1996).  In  the  case  of  LMP2  and  LMP7,  it 
has  been  demonstrated  that  these  replacements  are 
functionally  relevant  in  altering  the  specificity  of  antigen 
presentation  by  the  MHC  class  I  pathway.  The  catalyti- 
cally  active  p  subunits  of  the  eukaryotic  proteasome  are 
synthesized  with  propeptides,  similarly  to  the  p  subunit 
in  Thermoplasma  (Schmidtke  et  al.,  1997). 

In  the  crystal  structure  of  the  S.  cerevisiae  20S  protea¬ 
some,  the  opening  to  the  proteasome’s  interior,  formed 
by  the  outer  ring  of  the  a  subunits,  is  not  large  enough 
to  admit  even  an  unfolded  polypeptide  chain,  let  alone 
a  folded  protein  (Groll  et  al.,  1997).  The  degradation  of 
larger  substrates  requires  the  19S  complexes,  which 
attach  at  both  sides  of  the  20S  proteasome,  yielding  the 
26S  proteasome.  The  19S  complexes  contain  subunits 
that  bind  multi-Ub  chains,  at  least  one  Ub-specific  iso¬ 
peptidase  that  disassembles  these  chains,  and  several 


Cell 

490 


ATPases  that  are  thought  to  be  involved  in  perturbing 
the  substrate  conformationally  and  guiding  it  to  the  inte¬ 
rior  of  the  20S  proteasome  (Deveraux  et  al.,  1 994;  Hoch- 
strasser  et  al.,  1 995;  Jentsch  and  Schlenker,  1 995;  Coux 
et  al.,  1996). 

Studies  with  mammalian  cells  have  shown  that  the 
20S  proteasome  is  assembled  through  a  15-16S  inter¬ 
mediate,  apparently  a  half-proteasome.  This  intermedi¬ 
ate  contains  all  1 4  a  and  p  subunits,  some  of  which  are 
in  the  precursor  (propeptide-bearing)  form,  and  several 
uncharacterized  polypeptides  as  well  (Frentzel  et  al., 
1994;  Yang  et  al.,  1995;  Nandi  et  al.,  1997;  Schmidtke 
et  al.,  1 997).  Studies  on  the  yeast  20S  proteasome  have 
demonstrated  that  processing  of  proPre2p  (identical  to 
proDoa3p)  is  coupled  to  formation  of  the  20S  protea¬ 
some  from  two  half-proteasome  precursors  (Chen  and 
Hochstrasser,  1996).  Formation  of  the  active  site  capa¬ 
ble  of  autocatalytic  processing  of  proPre2p  depends  on 
the  juxtaposition  of  proPre2p  and  Prel  p  on  the  opposite 
sides  of  the  two  halves  of  the  proteasome.  This  mecha¬ 
nism  is  thought  to  prevent  activation  of  proteolytic  sites 
before  the  central  hydrolytic  chamber  has  been  sealed 
off  from  the  cytosol  through  association  of  the  two 
halves  of  the  proteasome  (Chen  and  Hochstrasser, 
1996).  The  Pre2p  propeptide  is  essential  for  the  forma¬ 
tion  of  functional  proteasomes.  Moreover,  this  propep¬ 
tide  can  operate  in  trans ,  suggesting  that  it  serves  a 
chaperone-like  function  in  proteasome  maturation  (Chen 
and  Hochstrasser,  1996). 

In  the  present  work,  we  identify  Umpl  p,  a  novel  pro¬ 
tein,  as  a  component  of  a  precursor  complex  of  the  20S 
proteasome.  This  precursor  contains  unprocessed  p 
subunits.  Upon  formation  of  the  20S  proteasome  from 
two  such  precursors,  the  propeptides  of  p  subunits  are 
removed,  a  process  that  is  accompanied  by  removal  of 
Umpl  p  from  the  proteasome  through  Umpl  p  degrada¬ 
tion,  which  requires  the  proteasome’s  proteolytic  activ¬ 
ity.  In  umpl  A  cells,  which  lack  Umplp,  coordination  of 
20S  complex  formation  and  processing  of  p  subunits  is 
impaired,  resulting  in  incompletely  or  (in  the  case  of 
Pre2p)  prematurely  processed  p  subunits.  These  find¬ 
ings  reveal  Umpl  p  as  a  novel  type  of  molecular  chaper¬ 
one,  a  short-lived  maturation  factor  required  for  the 
efficient  biogenesis  of  the  20S  proteasome.  We  also 
describe  genetic  evidence  that  the  propeptide  of  pro- 
Pre2p  is  required  for  Umpl  p-dependent  proteasome 
maturation,  and  we  present  a  model  that  accounts  for 
some  of  the  functions  of  Umplp  and  propeptides  in 
proteasome  maturation. 

Results 

The  UMP1  Gene  Is  Required  for 
Ubiquitin-Mediated  Proteolysis 
To  identify  the  genes  required  for  Ub-dependent  prote¬ 
olysis  in  S.  cerevisiae ,  we  screened  for  mutants  defec¬ 
tive  in  the  degradation  of  test  substrates.  One  such 
mutant,  termed  umpl-1  (ub-mediated  proteolysis),  ex¬ 
hibited  defects  in  the  degradation  of  several  normally 
short-lived  proteins  (see  below).  The  complementing 
UMP1  gene  encoded  a  148-residue  (16.8  kDa)  protein. 
Searches  of  the  databases  did  not  identify  close  homo¬ 
logs  of  Umplp  but  did  detect  similarities  to  regions  of 


several  proteins.  One  intriguing  similarity  was  between 
Umpl  p  and  C-terminal  regions  of  the  protease  inhibitor 
contrapsin  (25%  identity,  52%  similarity)  and  related 
proteins  (Figure  1A),  consistent  with  the  likely  role  of 
Umplp  as  an  inhibitor  of  premature  proteolytic  acti¬ 
vation  of  the  proteasome  precursor  complexes  (see 
below). 

Pulse-chase  analyses  in  umpl  A  mutants  revealed  a 
strong  stabilization  of  several  normally  short-lived  test 
proteins,  in  particular  an  N-end  rule  substrate  Arg-p- 
galactosidase  (Arg-p-gal),  which  bears  N-terminal  Arg, 
a  destabilizing  residue  (Varshavsky,  1996),  Ub-Pro-p- 
gal  (a  substrate  of  the  UFD  pathway;  Johnson  et  al., 

1 995;  Varshavsky,  1 997),  and  a2degi-P’gai  (a  substrate  of 
the  DOA  pathway;  Hochstrasser  and  Varshavsky,  1990; 
Hochstrasser  et  al.,  1995)  (Figure  IB).  The  recognition 
and  ubiquitylation  of  these  substrates  involve  different 
recognins  (E3  proteins)  and  different  Ub-conjugating 
(E2)  enzymes.  The  pleiotropic  character  of  the  umpl  A 
phenotype  suggested  that  Umpl  p  functions  downstream 
of  the  recognition  and  ubiquitylation  components  of  the 
Ub  system,  perhaps  at  the  step  of  proteolysis,  or  in 
regulating  the  supply  of  Ub. 

umpl  A  Mutants  Are  Hypersensitive  to  a  Variety 
of  Stresses,  Accumulate  Ub-Protein 
Conjugates,  and  Do  Not  Sporulate 
umpl  A  mutants  grew  more  slowly  than  congenic  wild- 
type  (wt)  cells  at  30°C  or  lower  temperatures  and  were 
severely  growth-impaired  at  higher  temperatures  (37°C) 
(Figure  1C  and  data  not  shown).  In  addition,  they  were 
hypersensitive  to  cadmium  ions  and  to  the  arginine  ana¬ 
log  canavanine  (Figure  1C).  Similar  phenotypes  have 
been  reported  for  mutants  in  genes  encoding  E2  (Ubc) 
enzymes  or  proteasome  subunits  (Heinemeyer  et  al., 
1993;  Jungmann  et  al.,  1993).  Comparisons  of  proteins 
from  whole-cell  extracts  of  wt  and  umpl  A  cells  by  immu- 
noblotting  with  anti-Ub  antibodies  showed  a  dramatic 
accumulation  of  Ub-protein  conjugates  in  the  umpl  A 
mutant.  At  the  same  time,  the  level  of  free  Ub  was  re¬ 
duced  in  umpl  A  cells  (data  not  shown).  Thus,  the  pri¬ 
mary  cause  of  the  umpl  A  phenotype  appears  to  be  the 
impaired  ability  of  umpl  A  cells  to  degrade  Ub-protein 
conjugates.  Homozygous  umpl  A  diploids  (strain  JD61) 
were  unable  to  sporulate  (data  not  shown). 

Umplp  Is  a  Component  of  Proteasome  Precursors 
To  investigate  Umpl  p  biochemically,  an  epitope  tag  was 
linked  to  its  C  terminus.  A  single  copy  of  the  modified 
UMP1  gene  (UMPI-ha),  expressed  from  its  natural  pro¬ 
moter  and  chromosomal  location,  restored  wt  growth 
rates.  We  analyzed  whole-cell  extracts  of  a  strain  ex¬ 
pressing  Umplp-ha  and  Prelp-ha  (the  latter  an  ha- 
tagged  3  subunit  of  the  proteasome)  by  gel  filtration  on 
Superose-6.  Umplp-ha  eluted  in  fractions  22  and  23, 
both  of  which  also  contained  Prel  p-ha  and  other  protea- 
somal  subunits  (Figures  2A,  2B,  and  3A)  but  lacked  the 
chymotrypsin-like  activity  of  the  mature  20S  protea¬ 
some.  The  apparent  size  of  the  Umpl  p-containing  com¬ 
plex  was  300-400  kDa  (Figure  2A).  20S  and  26S  protea¬ 
somes,  as  well  as  free  19S  caps,  and  20S  proteasomes 
with  one  attached  19S  cap,  eluted  in  earlier  fractions 
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Figure  1.  Umplp  Is  Required  for  Ubiquitin- 
Mediated  Proteolysis 

(A)  Sequence  similarities  between  Umplp 
and  C-terminal  parts  of  mouse  contrapsin, 
human  alphal  antitrypsin  inhibitor  {al  act)  and 
mouse  spi-2  (PIR  accession  numbers,  re¬ 
spectively,  JX0129,  A90475,  and  S31305). 
The  sequences  were  aligned  using  the  PileUp 
program  (GCG  package,  version  7.2,  Genet¬ 
ics  Computer  Group,  Madison,  Wl).  Gaps 
(indicated  by  hyphens)  were  used  to  max¬ 
imize  alignments.  Residues  identical  between 
Umpl  p  and  at  least  one  of  the  other  proteins 
are  shaded  in  gray.  Residues  identical  among 
at  least  two  proteins  other  than  Umpl  p  are 
boxed. 

(B)  Pulse-chase  analysis  comparing  the  met¬ 
abolic  stabilities  of  R-p-gal,  Ub-P-p-gal,  and 
MATa21^fl“p-gal  in  wt  ( UMP1 )  and  umpl  A 
cells.  The  open-ended  brackets  denote  the 
position  of  multiubiquitylated  (3-gal  species. 
Asterisks  denote  an  ~90  kDa  (S-gal  cleavage 
product  characteristic  of  short-lived  (5-gal  de¬ 
rivatives  (Dohmen  et  al.,  1991). 

(C)  Growth  of  umpl  A.  (JD59),  UMP1  (JD47- 
13C),  and  umpl  A  cells  transformed  with 
YCp50-LWVfP7  (expressing  wild-type  UMP1). 
Cells  were  streaked  on  YPD  plates  and  incu¬ 
bated  for  2  days  at  30°C  (or  at  37°C  where 
indicated),  with  0.8  ^g/ml  canavanine  or  30 
)im  CdCI2  where  indicated. 


(15-21),  as  determined  by  nondenaturing  gel  electro¬ 
phoresis  (Figure  2B  and  data  not  shown),  by  assays  of 
the  proteasome’s  proteolytic  activity,  and  by  immuno¬ 
logical  detection  of  Cim3p,  a  subunit  specific  for  the 
19S  cap  of  the  26S  proteasome  (Ghislain  et  al.,  1993). 

Cochromatography  of  Umpl  -ha  and  Prel  p-ha  in  frac¬ 
tions  22  and  23  (Figure  2A)  suggested  that  Umplp  is  a 
component  of  proteasome  precursor  complexes.  Con¬ 
sistent  with  this  possibility,  the  Umpl  p-containing  com¬ 
plex  had  a  higher  mobility  than  the  20S  proteasome 
upon  nondenaturing  gel  electrophoresis.  A  proteasome 
precursor  with  an  electrophoretic  mobility  indistinguish¬ 
able  from  that  of  the  Umpl  p-containing  complex  was 
also  detected  in  extracts  from  a  strain  that  expressed 
Puplp-ha,  another  p  subunit  of  the  20S  proteasome 
(Figure  2B). 

To  produce  independent  evidence  bearing  on  the  na¬ 
ture  of  the  Umpl  p-containing  complex,  we  constructed 
a  strain  that  expressed  both  Umpl  p-ha  and  doubly 
tagged  Prel  p-Flag-His6  (Prel  p-FH).  Affinity  chromatog¬ 
raphy  on  Ni-NTA-agarose  and  anti-Flag  antibody  resin 
was  then  used  to  purify  complexes  containing  Prel  p- 
FH.  Umpl  p-ha  cofractionated  with  Prel  p-FH  in  both 
affinity  purification  steps  (Figures  2C  and  2D),  confirm¬ 
ing  that  Umpl  p  is  a  component  of  a  distinct  proteasome- 
related  complex.  This  complex  sedimented  at  M5S  in 
sucrose  gradients  (data  not  shown).  Taken  together, 


this  evidence  strongly  suggested  that  the  Umpl  p-con- 
taining  complex  is  a  precursor  of  the  20S  proteasome 
that  is  similar  to  the  15-16S  precursors  observed  in  the 
maturation  pathway  of  mammalian  proteasomes  (Frent- 
zel  et  al.,  1994;  Yang  et  al.,  1995). 


The  Umplp  Proteasome  Precursor  Complex 
Contains  Unprocessed  p  Subunits 
Several  subunits  of  the  proteasome  are  synthesized  as 
precursors  containing  N-terminal  extensions  (propep¬ 
tides)  that  are  absent  from  the  mature  proteasome. 
These  precursors  are  detected  in  the  mammalian  15-16S 
complex  (Frentzel  et  al.,  1994;  Yang  et  al.,  1995).  To 
compare  the  Umpl  p-containing  complex  with  the  mam¬ 
malian  1 5-1 6S  complex,  we  analyzed  the  former  for  the 
presence  of  propeptide-containing  proteasomal  sub¬ 
units.  These  p  subunits  were  modified  by  the  addition  of 
C-terminal  ha  tags.  Strains  expressing  Pupl  p-ha,  Pre2p- 
ha,  and  Pre3p-ha  grew  at  wt  rates  (data  not  shown).  When 
extracts  of  these  strains  were  fractionated  by  gel  filtra¬ 
tion  on  Superose-6,  the  fractions  containing  the  Umpl  p 
complex  contained  largely  the  p  subunit  precursors, 
proPupl  -ha,  proPre2p-ha,  and  proPre3p-ha  (Figure  3A). 
The  corresponding  mature  p  subunits  largely  eluted  in 
the  earlier  fractions  that  contained  20S  and  26S  protea¬ 
somes  (Figure  3A).  Subtle  but  reproducible  differences 
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Figure  2.  Umpl  p  Is  a  Component  of  Proteasome  Precursor  Com¬ 
plexes 

(A)  An  extract  from  strain  JD127  expressing  Umplp-ha  and  Prelp- 
ha  instead  of  wt  versions  of  these  proteins  was  fractionated  by 
gel  filtration  on  Superose-6.  The  upper  panel  shows  the  results  of 
measurements  of  chymotrypsin-like  activity  in  the  relevant  fractions 
(12-29).  The  last  fraction,  representing  the  void  volume  (v0),  and 
positions  of  the  peaks  of  marker  proteins  are  indicated  by  dashed 
lines.  The  same  fractions  were  analyzed  by  SDS-PAGE  and  immu- 
noblotting  with  anti-Cim3p  and  anti-ha  antibodies,  which  detected 
Cim3p,  Prelp-ha,  and  Umplp-ha,  as  indicated. 

(B)  Analysis  of  Superose-fractionated  20S  proteasomes  and  precur¬ 
sor  complexes  by  nondenaturing  gel  electrophoresis  and  immu- 
noblotting  with  anti-ha  antibody.  The  strains  were  JD1 39  and  JD1 29 
expressing,  respectively,  Puplp-ha  and  Umplp-ha.  Asterisk,  20S 
proteasome  with  one  attached  19S  regulator  cap.  The  26S  protea¬ 
some  eluted  in  fractions  14-17  of  Superose-6  (not  shown). 

(C)  Affinity  purification  of  Prel  p-FH  on  Ni-(NTA)-Sepharose  and  co¬ 
purification  of  Umplp-ha.  Purification  of  proteins  from  an  extract 
of  strain  JD126  expressing  Umplp-ha  and  Prel  p-FH  was  earned 
out  as  described  in  the  Experimental  Procedures.  Different  fractions 
were  analyzed  by  SDS-PAGE  and  immunoblotting  with  anti-Flag 
(upper  panel)  and  anti-ha  antibodies  (lower  panel).  Ce,  crude  extract; 
ft,  flow-through.  In  contrast,  no  binding  of  Umpl  p  to  the  resin  was 
observed  when  extracts  from  strain  JD129  expressing  untagged 
Prel  p  were  used  (not  shown). 

(D)  Affinity  purification  of  Prel  p-FH  on  anti-Flag  antibody  agarose 
resin.  Material  that  was  eluted  from  the  Ni-(NTA)-Sepharose  column 
shown  in  panel  C  was  subjected  to  a  second  affinity  purification 
on  anti-Flag  antibody  resin.  Prel  p-FH  was  specifically  eluted  with 
increasing  concentrations  of  the  Flag  peptide.  W,  wash. 


were  detected  among  the  tagged  0  subunits  with  re¬ 
spect  to  the  distribution  of  their  precursor  and  pro¬ 
cessed  forms  in  various  fractions.  For  example,  pro- 
Puplp-ha  was  detectable  only  in  fractions  22  and  23, 
and  was  absent  from  fraction  21 .  By  contrast,  proPre2p- 
ha  was  detectable  in  the  larger  complexes  (down  to 
fraction  1 9),  while  fractions  22  and  23  already  contained 
some  processed  Pre3p-ha.  These  patterns  suggested 
a  defined  order  of  processing  events.  One  conclusion 
from  these  experiments  is  that  the  Umpl  p-containing 
complex  is  a  precursor  of  the  20S  proteasome  that  con¬ 
tains  unprocessed  0  subunits  and  is  therefore  proteolyti- 
cally  inactive  (Figures  3A  and  3C). 

Umplp  Is  Required  for  Correct 
Proteasome  Maturation 

Next,  we  asked  what  effect  the  umpl  A  mutation  has  on 
maturation  and  activity  of  the  proteasome.  Specifically, 
the  analyses  described  above  were  repeated  with  ex¬ 
tracts  obtained  from  umpl  A  cells.  Figure  3C  shows  that 
there  was  a  significant  reduction  in  the  three  proteolytic 
activities  of  the  proteasome  in  the  fractions  containing 
the  20S  and  26S  proteasomes  (fractions  14-21).  The 
reduction  of  the  specific  activity  appears  to  be  partially 
compensated  by  an  increased  expression  of  protea¬ 
somes  (Figures  3A  and  3B,  data  not  shown)  similar  to 
that  observed  in  cells  expressing  mutant  0  subunits 
(Arendt  and  Hochstrasser,  1997).  In  addition,  a  new  (ab¬ 
sent  from  wt  cells)  peak  of  chymotrypsin-like  activity 
encompassing  fractions  22  and  23  was  detected  in  ex¬ 
tracts  from  umpl  A  cells.  These  were  the  fractions  that 
contained  the  Umplp  complex  from  wt  extracts  (see 
above  and  Figure  3A),  suggesting  that  the  absence  of 
Umplp  from  the  proteasome  precursor  complex  was 
the  cause  for  premature  activation  of  its  chymotryptic 
activity  in  the  umpl  A  mutant.  To  verify  that  the  detected 
activity  was  actually  of  the  ~15S  precursor  complex 
from  umpl  A  cells,  the  fraction  22  samples  from  strains 
expressing  either  wt  Pre2p  or  Pre2p-ha  were  incubated 
with  anti-ha  antibody  and  protein  A  Sepharose.  This 
treatment  did  not  deplete  the  chymotryptic  activity  from 
the  sample  containing  untagged  Pre2p  but  did  deplete 
-.,80%  of  the  activity  from  an  otherwise  identical  sample 
derived  from  umpl  A  cells  expressing  Pre2p-ha  (data  not 
shown).  Thus,  the  chymotryptic  activity  of  this  fraction 
resided  in  the  proteasome  precursor  complexes. 

To  follow  the  appearance  of  the  precursor  and  mature 
forms  of  0  subunits  in  proteasome  precursors  and  ma¬ 
ture  proteasomes,  we  analyzed  extracts  from  umpl  A 
cells  expressing  ha-tagged  versions  of  these  subunits 
by  immunoblotting  with  anti-ha  antibody.  The  process¬ 
ing  of  the  three  analyzed  0  subunits,  Puplp-ha,  Pre2p- 
ha,  and  Pre3p-ha,  was  strikingly  different  in  umpl  A  and 
wt  cells  (Figure  3A).  For  all  three  subunits,  a  dramatic 
increase  of  their  precursors  was  detected  in  the  frac¬ 
tions  (1 4-21 )  that  contained  the  20S  and  26S  forms  of  the 
proteasome.  In  addition,  different  processed  variants, 
possibly  representing  processing  intermediates  of  pro- 
Pre2p,  could  be  detected  in  fractions  22  and  23,  which 
contained  the  ~1 5S  proteasome  precursor,  and  in  some 
of  the  proteasome-containing  fractions  as  well  (Figure 
3A).  In  contrast,  with  the  exception  of  Pre3p,  almost 
no  processed  0  subunits  were  present  in  the  fractions 


Proteasome  Maturation  Factor  Umpl  p 
493 


k  UMP1  umpl  A 

1213  1415  16  17  16  19  20  21  22  23  24  25  26  2728  29  fraction  12  13  14  15  16  17  ta  19  Z)  21  22  23  2*  25  26  27  2B  » 


^fejM-proPupIp-ha- 
Puplp-ha  - 

-  proPre2p-ha~| 

-  Pre2p-ha 


proPre3p-ha  - 
Pre3p-ha  - 


-Umplp-ha 


B 

UMP1  umpl  A 

_ Pupl  p-ha _ 


■  sV 

-p- 

my  vm 

-m- 

0  10  20  30 

0  10  20  30 

Pre2p-ha 


-- —  -T-  r-rr  - — 

_p- 

xW. 

* 

-m- 

0  10  20  30  0  10  20  30 


0  10  20  30  0  10  20  30 

chase  time  (min) 


fraction 


Figure  3.  Umpl  p  Is  Required  for  the  Correct 
Processing  of  Proteasomal  p  Subunits 

(A)  SDS-PAGE  and  anti-ha  immunoblot  analy¬ 
ses  of  extracts  fractionated  on  Superose-6. 
The  congenic  yeast  strains  used  were  JD1 31 , 
JD132,  JD133,  JD134,  JD135,  and  JD136 
(Table  1).  Western  blots  of  the  umpl  A  ex¬ 
tracts  were  developed  for  a  shorter  time  to 
compensate  for  an  increased  expression  of 
proteasomal  subunits  in  the  mutant.  Aster¬ 
isks,  processing  intermediates  or  incom¬ 
pletely  processed  forms  of  Pre2p-ha  that  were 
observed  in  the  umpl  A  mutant  but  not  in  wt 
cells. 

(B)  Pulse-chase  analysis  comparing  the  rate 
of  p  subunit  processing  in  wt  cells  to  that  in 
umpl  A  cells  (same  strains  as  in  A).  Cells  were 
pulse-labeled  with  “S-Met/Cys  for  5  min.  Pro¬ 
teins  were  precipitated  with  anti-ha  antibody. 
Fluorographic  exposures  were  ~3  times 
longer  for  the  UMP1  samples.  P,  propeptide- 
containing  precursor  form;  m,  mature  form; 
asterisks,  as  in  (A);  dots,  nonspecific  bands 
present  in  some  samples. 

(C)  Comparison  of  proteolytic  activities  in  ex¬ 
tracts  with  equal  amounts  of  total  protein  of 
UMP1  (JD127)  and  umpl  A  cells  (JD75)  frac¬ 
tionated  by  gel  filtration  on  Superose-6.  The 
profile  of  the  chymotrypsin-like  activity  in 
UMP1  is  the  same  as  in  Figure  2A. 


containing  the  ^15S  proteasome  precursor  from  wt 
cells,  and  no  processing  intermediates  of  Pre 2p  could 
be  detected  either  (Figure  3A).  These  findings  indicate 
that  Umpl  p  is  required  to  prevent  premature  processing 
of  at  least  proPre2p  and  that  the  presence  of  Umpl  p  is 
important  for  the  coordination  of  proteasome  assembly 
and  subunit  processing. 

A  defect  of  umpl  A  cells  in  the  processing  of  p  subunits 
was  also  observed  by  following  the  metabolic  fate  of 
these  subunits  in  pulse-chase  experiments.  In  wt  cells, 
most  of  the  propeptide-containing  forms  of  p  subunits 
(proPupl  -ha,  proPre2p-ha  and  proPre3p-ha)  were  con¬ 
verted  into  their  mature  counterparts  during  the  30-min 
chase  (Figure  3B).  By  contrast,  in  the  umpl  A  mutant,  the 
bulk  of  the  p  subunit  precursors  remained  unprocessed 
during  this  time,  a  finding  consistent  with  the  results  of 
immunoblot  analyses  (Figure  3A).  In  the  case  of  Pre2p- 
ha,  pulse-chase  analysis  again  revealed  the  species  of 
intermediate  size  (putative  processing  intermediates)  in 
extracts  from  umpl  A  cells.  These  species  were  absent 
from  the  equivalent  samples  derived  from  wt  cells  (Fig¬ 
ure  3B). 

Umplp  Is  Degraded  upon  Formation 
of  the  20S  Proteasome 

The  experiments  above  demonstrated  that  Umpl  p  is  a 
component  of  proteasome  precursors  that  is  absent 
from  the  mature  20S  proteasome.  One  possibility  was 


that  Umpl  p  leaves  the  precursor  complex  upon  forma¬ 
tion  of  the  20S  proteasome  from  the  two  5S  precur¬ 
sors  and  may  then  assist  with  another  round  of  protea¬ 
some  assembly,  thus  acting  catalytically.  It  was  also 
possible  that  formation  of  the  20S  proteasome  sterically 
traps  Umplp.  Since  the  formation  of  the  20S  form  of 
the  proteasome  coincides  with  the  appearance  of  its 
proteolytic  activities  (Frentzel  et  al.,  1994;  Chen  and 
Hochstrasser,  1996),  Umplp  might  be  degraded  by  the 
newly  formed  20S  proteasome.  Pulse-chase  analysis  of 
Umplp-ha  in  wt  cells  showed  that  Umplp-ha  is  indeed 
degraded  in  vivo,  the  rate  of  its  degradation  being  similar 
to  the  rate  of  disappearance  of  proPupIp-ha,  which  is 
converted  to  Puplp-ha  upon  maturation  of  the  protea¬ 
some  (compare  Figures  4A  and  3B). 


Umplp  Is  Stabilized  in  Proteasome  Mutants  and 
Can  Be  Detected  Inside  the  20S  Particle 
If  Umplp  becomes  a  substrate  of  the  newly  formed 
20S  proteasome,  the  degradation  of  Umpl  p  should  be 
inhibited  by  mutations  that  affect  the  proteasome’s  pro¬ 
teolytic  activities.  We  used  pulse-chase  assays  to  follow 
the  metabolic  fate  of  Umplp-ha  in  the  pre 7- 1  mutant, 
which  is  known  to  be  deficient  in  the  chymotryptic  activ¬ 
ity  of  the  proteasome  (Heinemeyer  et  al.,  1 993).  Umpl  p 
was  partially  stabilized  in  prel-1  cells,  as  indicated  by 
the  increased  amount  of  ^S-labeled  Umplp  in  prel-1 
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extracts  versus  wt  extracts  that  contained  the  same  total 
amount  of  TCA-precipitable  35S  (Figure  4A).  However,  we 
still  observed  a  significant  decrease  of  pulse-labeled 
Umplp  upon  increasing  chase  times  when  the  immuno- 
precipitation  was  carried  out  under  nondenaturing  con¬ 
ditions  (Figure  4A).  By  contrast,  when  the  extracts  were 
treated  with  0.4%  SDS  at  1 00°C  before  immunoprecipi¬ 
tation,  virtually  no  decrease  of  the  Umpl  p  signal  during 
the  chase  was  observed  in  prel-1  cells.  These  results 
were  consistent  with  the  possibility  that  the  newly 
formed  Umplp-ha  became  inaccessible  to  the  anti-ha 
antibody  during  the  chase  because  it  became  trapped 
within  the  newly  formed  prel-1  proteasome.  If  so, 
Umpl  p  was  expected  to  be  present  in  fractions  from 
the  Superose-6  column  that  corresponded  to  the  20S 
and  26S  proteasomes  in  the  prel-1  mutant.  Indeed, 
whereas  in  wt  (PRE1)  cells  Umplp-ha  was  detected  by 
SDS-PAGE  and  immunoblotting  only  in  fractions  22  and 
23  (corresponding  to  the  5S  proteasome  precursor 
complex),  in  the  mutant  {prel-1 )  cells  Umplp-ha  was 
detected  in  fractions  15-23,  indicating  that  it  was  also 
present  in  mature  proteasomes  (Figure  4B).  We  assayed 
the  accessibility  of  Umplp-ha  to  anti-ha  antibody  by 
immunoprecipitation  from  fraction  19  (the  20S  protea¬ 
some)  of  extracts  from  the  prel-1  mutant  and  from  frac¬ 
tion  22  (the  ~1 5S  proteasome  precursor)  of  extract  from 
PRE1  cells.  Umplp-ha  could  be  immunoprecipitated 
from  either  fraction  19  or  22  following  pretreatment  with 
0  1  %  SDS,  but  not  under  nondenaturing  conditions  (Fig¬ 
ure  4C),  indicating  that  the  ha  tag  was  inaccessible  to 
the  anti-ha  antibody  in  both  the  prel-1  proteasome  and 
the  precursor  complex.  However,  when  otherwise  iden¬ 
tical  assays  were  carried  out  with  a  polyclonal  anti- 
Umpl  p  antiserum,  most  of  Umpl  p  could  be  immunopre¬ 
cipitated  under  nondenaturing  conditions  from  fraction 
22  (the  rsj  15S  proteasome  precursor)  but  was  still  not 
immunoprecipitated  from  fraction  19  (the  20S  prel-1 
proteasome).  In  contrast,  when  polyclonal  anti-protea- 
some  antibodies  were  used,  Umplp  was  precipitated 
quantitatively  along  with  both  complexes  (Figure  4C). 

These  results  indicated  that  a  part  of  Umplp  other 
than  its  C  terminus  is  accessible  to  anti-Urnpl  p  antibod¬ 
ies  in  the  ~15S  precursor.  However,  Umplp  becomes 
entirely  inaccessible  (under  nondenaturing  conditions) 
upon  formation  of  the  20S  proteasome  from  two  -15S 
precursors.  This  interpretation  was  supported  by  exam¬ 
ining  the  sensitivity  of  Umpl  p-ha  in  different  complexes 
to  trypsin  digestion  (Figure  4D).  Specifically,  Umplp-ha 
in  the  20S  proteasomes  (fraction  19)  from  prel-1  cells 
was  completely  protected  against  trypsin.  In  contrast, 
Umpl  p-ha  in  the  -1 5S  proteasome  precursors  was  de- 
tectably  accessible  to  trypsin.  The  degradation  of  Umpl  p 
was  incomplete,  resulting  in  a  protected  fragment  of 
Umpl  p-ha  that  lacked  the  -5  kDa  N-terminal  region  but 
retained  the  C-terminal  ha  tag  (Figure  4D).  Interestingly, 
in  a  similar  experiment  with  a  strain  expressing  pro- 
Pre2p-ha,  we  observed  that,  in  the  Umpl -containing 
~1 5S  proteasome  precursor  complex  (fraction  22),  pro- 
Pre2p-ha  was  shortened  by  trypsin  treatment  to  yield  a 
product  whose  electrophoretic  mobility  was  indistin¬ 
guishable  from  that  of  the  natural  mature  Pre2p-ha  (Fig¬ 
ure  4D).  In  these  experiments,  the  overall  structure  of 
the  Umpl  p  proteasome  precursor  complex  remained 
intact  as  judged  by  native  gel  analysis  of  trypsin-treated 
material  (data  not  shown). 
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Figure  4.  Umpl  p  Is  Stabilized  in  prel-1  Mutants  Cells  that  Are  De¬ 
fective  in  the  Proteasome's  Chymotrypsin-like  Activity  and  Persists 
in  20S  Proteasomes 

(A)  Pulse-chase  analysis  of  Umplp-ha  in  wt  and  prel-1  cells.  The 
strains  used  were  JD150  (PRE1)  and  JD151  (pre J-T)  (Table  1),  both 
expressing  Umplp-ha.  The  cells  were  pulse-labeled  with  S-Met/ 
Cys  for  5  min.  Extracts  were  prepared  from  samples  taken  at  differ¬ 
ent  chase  times.  Samples  were  then  split  in  halves.  One-half  (-SDS) 
was  subjected  to  immunoprecipitation  with  anti-ha  antibody  ac- 
cording  to  the  standard  pulse-chase  protocol.  The  other  half  (+SDS) 
was  adjusted  to  0.4%  SDS  and  incubated  at  100“C  for  5  min,  then 
diluted  with  extraction  buffer  to  the  final  concentration  of  0.1  %  SDS 
prior  to  immunoprecipitation. 

(B-D)  SDS-PAGE  and  anti-ha  immunoblot  analyses.  (B)  Detection 
of  Umpl  p-ha-containing  complexes  in  extracts  fractionated  by  Su¬ 
perose-6  gel  filtration.  Strains  were  the  same  as  in  (A).  Note  the 
accumulation  of  Umplp-ha  in  fractions  containing  the  20S  and 
26S  proteasome  (fractions  15-21)  in  prel-1.  (C)  Immunoblot  analy¬ 
ses  of  immunoprecipitations  were  carried  out  with  matenal  from 
Superose-6  fraction  19  of  extracts  from  strain  JD151  and  fraction 
22  of  extracts  from  strain  JD150,  using  the  indicated  antibodies. 
Immunoprecipitations  were  performed  without  addition  of  SDS 
(-SDS)  or  after  boiling  in  the  presence  of  0.1%  SDS  (+SDS).  P, 
precipitates;  S,  supernatants.  (D)  Assaying  trypsin  sensitivity  of  the 
same  material  as  in  (C),  and  of  the  Superose-6  fraction  22,  from  strain 
JD138  expressing  Pre2p-ha.  P,  proPre2p-ha;  m,  mature  Pre2p-ha, 
asterisks  mark  trypsin  digestion  products. 


In  the  umpl  A  Mutant,  the  Propeptide  of  Pre2p 
Is  Not  Required  for  Incorporation 
of  Pre2p  into  the  Proteasome 

Chen  and  Hochstrasser  (1996)  have  elegantly  demon¬ 
strated  that  the  propeptide  of  Pre2p,  if  separated  from 
the  mature  Pre2p,  could  function  in  trans  and  thereby 


Proteasome  Maturation  Factor  Umpl  p 
495 


/■ 


A  viability 


1 

i  ProlT  . 

Pre2p 

_J  /UMP1 

+  (wt) 

2  0 

Pro! 

EZ 

Pre2p 

1  /uMPI 

+ 

3  [ 

Pro] 

EZ 

Pre2p 

1  / umpl  A 

+ 

4 

EZ 

Pre2p 

| 

n 

- 

5 

EZ 

Pre2p 

1  j umpl  A 

+ 

Figure  5.  The  Pre2p  Propeptide  Is  Not  Essential  in  the  umpA  Mutant 

(A)  Representation  of  different  genotypes  of  a  set  of  congenic  strains 
and  their  effects  on  cell  viability.  Strain  1  (JD50-1  OB)  is  the  wt  control. 
Strain  2  (MHY952)  expresses  Pre2p/Doa3p-ALS  with  N-terminal  Thr 
(T)  as  a  fusion  to  ubiquitin  (not  shown),  and  the  Pre2p/Doa3p  propep¬ 
tide  (LS)  from  two  separate  plasmids  (Chen  and  Hochstrasser,  1 996). 
Strain  3  (JD160),  same  as  strain  2  but  umpl  A.  Strain  4  would  be 
derived  from  strain  2  through  the  loss  of  the  plasmid  encoding 
Pre2p/Doa3p-LS  that  operates  in  trans.  We  were  unable  to  produce 
isolates  that  lost  this  plasmid,  confirming  the  earlier  demonstration 
that  the  propeptide  is  essential  for  cell  viability  (Chen  and  Hochstras¬ 
ser,  1996).  Strain  5  (JD1 63)  is  derived  from  strain  3  through  the  loss 
of  the  propeptide-encoding  plasmid.  Loss  of  the  plasmid  did  not 
affect  cell  viability  in  the  umpl  A  background. 

(B)  Induction  of  UMP1  expression  in  the  umpl  A  background  inhibits 
the  growth  of  a  strain  that  lacks  the  Pre2p  propeptide.  Strains  3 
(plus)  and  5  (minus)  shown  in  (A)  were  transformed  with  pJD429  that 
expressed  UMP1  from  the  galactose-inducible,  glucose-repressible 
Pg>u.t  promoter,  or  with  an  empty  vector  as  a  control.  Transformants 
selected  on  glucose  media  were  pregrown  on  selective  media  con¬ 
taining  raffinose  as  a  carbon  source,  then  streaked  onto  selective 
media  with  2%  galactose  or  2%  glucose,  and  grown  for  3  days  at 
30°C. 


still  allow  the  incorporation  of  Pre2p  into  the  20S  protea¬ 
some.  Under  these  conditions,  expression  of  the  pro¬ 
peptide  is  essential  for  cell  viability,  suggesting  that  the 
propeptide,  in  c/s  or  at  least  in  transt  is  required  for  the 
assembly  of  an  active  proteasome  (Chen  and  Hochstras¬ 
ser,  1996). 

We  confirmed  this  result  using  their  strain  MHY952, 
in  which  mature  Pre2p  and  the  propeptide  are  expressed 
on  separate  plasmids  (Figure  5A).  Specifically,  under 
nonselective  growth  conditions  this  strain  did  not  yield 
viable  cells  that  had  lost  the  plasmid  expressing  the 


propeptide.  Surprisingly,  however,  when  we  constructed 
and  examined  a  congenic  umpl  A  derivative  of  this 
strain,  we  noticed  that  it  lost  the  propeptide-expressing 
plasmid  at  a  frequency  suggesting  that  this  plasmid  did 
not  provide  a  significant  growth  advantage  to  cells  (data 
not  shown).  Indeed,  growth  rates  of  cells  with  the  plas¬ 
mid  were  indistinguishable  from  those  lacking  it  (Figure 
5B).  This  result  demonstrated  that  umpl  A  is  a  suppres¬ 
sor  of  a  deletion  of  the  propeptide  of  Pre2p.  If  so,  expres¬ 
sion  of  UMP1  in  the  (umpl A  PRE2-Apro )  background 
should  be  incompatible  with  cell  viability.  This  prediction 
was  confirmed  when  we  examined  growth  properties 
of  the  (umpl A  PRE2-Apro)  strain  transformed  with  a 
plasmid  expressing  UMP1  from  the  galactose-induci¬ 
ble  PGAU  promoter.  On  glucose-containing  media,  the 
growth  rate  of  this  transformant  was  indistinguishable 
from  that  of  an  otherwise  identical  transformant  carrying 
the  propeptide-expressing  plasmid.  In  contrast,  on  ga¬ 
lactose-containing  media,  the  (umpl A  PRE2-Apro)  mu¬ 
tant  containing  the  Pgau-UMPI  plasmid  was  unable  to 
grow,  whereas  the  control  strains  grew  (Figure  5B).  This 
result  indicated  that  the  propeptide  of  Pre2p  is  essential 
for  the  Umpl  p-assisted  maturation  of  the  proteasome 
but  is  not  essential  for  (partially)  defective  maturation 
of  the  proteasome  that  takes  place  in  the  absence  of 
Umpl  p. 


Discussion 

Umplp,  a  Novel  Maturation  Factor 
of  the  20S  Proteasome 

We  describe  the  discovery  of  a  proteasome  maturation 
factor,  termed  Umpl  p,  whose  unique  properties  include 
a  short  in  vivo  half-life  that  is  due  to  its  degradation 
within  the  newly  formed  proteasome.  Umpl  p  is  required 
for  coordination  of  the  proteasome’s  physical  assembly 
and  enzymatic  activation.  In  addition,  the  normally  es¬ 
sential  propeptide  of  the  Pre2p  p  subunit  was  found 
to  become  nonessential  in  the  absence  of  Umpl  p.  We 
report  the  following  specific  results, 
umpl  A  Mutants  Are  Defective  in 
Ub-Mediated  Proteolysis 

They  are  sensitive  to  a  variety  of  stresses  and  accu¬ 
mulate  Ub-protein  conjugates  (Figure  1  and  data  not 
shown).  All  of  the  observed  phenotypes  of  umpl  A  cells 
are  consistent  with  the  conclusion  that  umpl  mutants 
are  impaired  in  proteasome  biogenesis  and,  conse¬ 
quently,  in  the  degradation  of  ubiquitylated  proteins. 
The  UMP1  Gene 

It  encodes  a  polypeptide  with  a  calculated  molecular 
mass  of  16.8  kDa.  No  close  sequence  homologs  of 
Umpl  p  were  detected  in  the  current  databases.  How¬ 
ever,  the  presence  of  a  small  protein  similar  in  size  to 
Umplp  in  preparations  of  the  half-proteasome  precur¬ 
sors  in  mammals  (Frentzel  et  a!.,  1 994;  Yang  et  al.,  1 995; 
Nandi  et  al.,  1997;  Schmidtke  et  al.,  1997)  suggests  the 
presence  of  a  functional  homolog  of  Umpl  p  in  the  mam¬ 
malian  proteasome  maturation  pathway. 

Umplp  Is  a  Component  of  Proteasome  Precursors 
that  Contain  Unprocessed  p  Subunits 
The  Umpl  p-containing  complex  has  a  molecular  mass 
of  300-400  kDa,  sediments  at  ~15S,  migrates  signifi¬ 
cantly  faster  in  native  gels  than  the  20S  proteasome, 
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Figure  6.  A  Model  of  the  Umplp  Function  in 
Proteasome  Maturation 
Shown  is  a  schematic  view  of  the  20S  protea¬ 
some  and  its  precursor  forms,  with  the  a  and 
p  subunits  as  blue  and  green  balls,  respec¬ 
tively.  Three  of  the  p  subunits  are  drawn  with 
extensions  that  represent  propeptides.  The  p 
subunits  in  the  front  are  drawn  transparent  in 
order  to  allow  a  view  into  the  interior  chamber 
of  the  proteasome.  Structure  A  is  a  protea¬ 
some  precursor  complex  (half  proteasome), 
characterized  by  the  presence  of  Umpl  p  and 
unprocessed  p  subunits.  In  step  1,  two  of 
these  precursors  join  to  build  structure  B,  a 
step  that  leads  to  conformational  or  posi¬ 
tional  shifts  of  Umpl  p  and  propeptides.  Con¬ 
formational  changes  of  the  propeptides  trig¬ 
ger  their  autocatalytic  processing  (step  2), 
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and  has  a  subunit  composition  that  is  highly  similar  to 
that  of  the  20S  proteasome,  as  determined  by  SDS- 
PAGE  and  immunoblot  analyses  with  anti-20S  protea¬ 
some  antibodies  (Figure  2  and  data  not  shown).  Taken 
together,  these  results  indicate  that  the  Umpl  p  is  con¬ 
tained  within  a  half-proteasome  precursor  complex, 
whose  counterpart  has  been  described  in  the  maturation 
pathway  of  mammalian  proteasomes  (Frentzel  et  al., 
1994;  Yang  et  al.,  1995;  Nandi  et  al.,  1997;  Schmidtke 
etal.,  1997). 

Maturation  of  Puplp,  Pre2p,  and  Pre3p  Is  Strongly 
Impaired  in  the  umpl  A  Mutant 

Considerably  increased  amounts  of  the  unprocessed 
(propeptide-containing)  forms  of  these  p  subunits  were 
detected  in  the  20S  and  26S  fractions  from  umpl  A  cells 
(Figure  3).  In  addition,  the  proteasomes  assembled  in 
umpfA  cells  had  reduced  activity.  Thus,  Umplp  is  re¬ 
quired  for  the  correct  and  efficient  maturation  of  the 
proteasomes. 

Partially  Processed  Forms  of  Pro-Pre2p  Are  Detected 
in  the  Proteasome  Precursor  Complex  and  the 
Proteasome  of  umpl  A  Cells  but  Not  in  WT  Cells 
These  incompletely  and  prematurely  processed  forms 
of  Pre2p  (Figures  3A  and  3B)  appear  to  underlie  a  chy- 
motrypsin-like  activity  that  was  associated  with  the  pre¬ 
cursor  complex  in  umpl  A  cells  but  was  absent  from  the 
same  complex  of  wt  cells  (Figure  3C).  This  was  sug¬ 
gested  by  the  observation  that  this  activity  was  not  in¬ 
hibited  by  treatment  with  lactacystin  and  was  absent 
from  cells  lacking  the  Pre2p  propeptide  (P.  C.  R.  and 
R.  J.  D.,  unpublished  data).  These  findings  indicated 
that  Umplp  has  a  dual  role  in  proteasome  maturation. 
Specifically,  Umplp  prevents  premature  processing  of 
proPre2p  in  the  precursor  complex  and  is  also  required 
for  the  correct  maturation  of  active  sites  upon  assembly 
of  the  proteasome  (see  below). 

Umplp  Is  Degraded  by  the  Newly 
Formed  Proteasome 

The  kinetics  of  the  rapid  degradation  of  Umpl  p  is  similar 
to  the  kinetics  of  processing  of  Puplp  (Figures  3B  and 
4A),  suggesting  that  Umpl  p  is  destroyed  by  the  protea¬ 
some  upon  its  formation  from  the  two  half-proteasome 
precursors  and  the  accompanying  maturation  of  active 


sites  (Chen  and  Hochstrasser,  1996).  This  conclusion 
was  strongly  supported  by  the  observation  that  Umpl  p 
is  significantly  stabilized  in  the  prel-1  mutant,  which  is 
deficient  in  the  proteasome’s  chymotrypsin-like  activity 
(Figure  4A).  Specifically,  in  contrast  to  the  pattern  in  wt 
cells,  Umplp  was  detectable  in  the  20S  and  26S  forms 
of  the  prel-1  proteasome.  Umplp  in  these  complexes 
was  shielded  both  from  detection  by  antibodies  and 
from  digestion  by  trypsin  (Figures  4B-4D).  These  and 
related  data  suggested  a  model  in  which  Umplp  is  en¬ 
cased  within  the  20S  proteasome  upon  its  assembly 
from  the  two  Umpl  p-containing  half-proteasome  pre¬ 
cursor  complexes  (Figure  6).  Formation  of  the  20S  struc¬ 
ture  triggers  active  site  maturation,  resulting  in  the  deg¬ 
radation  of  Umplp. 

The  Propeptide  of  Pre2p  Is  Not  Essential 
in  umpl  A  Cells 

Chen  and  Hochstrasser  (1 996)  demonstrated  that  the 
propeptide  of  Pre2p  is  essential  for  cell  viability  and  that 
this  propeptide  can  operate  in  trans.  They  concluded 
that  the  propeptide,  in  addition  to  rendering  proteasome 
precursors  proteolytically  inactive,  serves  a  chaperone¬ 
like  function  required  for  the  correct  incorporation  of 
Pre2p  into  the  proteasome.  One  striking  result  of  the 
present  work  is  that  the  propeptide  of  Pre2p  becomes 
dispensable  in  the  umpl  A  mutant  (Figure  5).  In  fact, 
the  presence  of  the  propeptide,  which  is  essential  for 
viability  of  wt  cells,  does  not  provide  a  growth  advantage 
to  umpl  A  cells  that  express  it. 

On  the  Functions  of  Umplp  and  p  Subunit 
Propeptides  in  Proteasome  Maturation 
Our  results  suggested  a  model  illustrated  in  Figure  6. 
This  model  is  based  in  part  on  the  idea  described  by 
Chen  and  Hochstrasser  (1996) -that  the  active  sites  of 
the  proteasome  are  formed  upon  its  assembly  from  two 
half-proteasome  precursors,  through  a  juxtaposition  of 
subunits  at  the  interface  of  the  dyad-related  halves.  The 
advantage  of  this  mechanism  is  that  autocatalytic  matu¬ 
ration  of  the  active  sites  (through  processing  of  the  rele¬ 
vant  p  subunits)  is  coupled  to  the  assembly  of  the  pro¬ 
teasome,  thus  avoiding  premature  processing  of  the 
propeptides  of  p  subunits.  This  view  was  supported  by 
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Table  1 .  Yeast  Strains 


Strain 


YPH500 

JDC9-11 

JD47-13C 

JD59 

JD53 

JD81-1A 

JD75 

JD126 

JD127 

JD129 

JD131 

JD132 

JD133 

JD134 

JD135 

JD136 

JD138 

JD139 

BBY45 

JD61 

JD50-10B 

MHY952 

JD160 

JD163 

WCG4a 

YHI29/1 

JD150 

JD151 


Relevant  Genotype 

MAT  a  ade2-101  his3-&200  Ieu2-M  Iys2-801  trp1-±63  ura3-52 
MATa  umpl-1 

MATa  his3-&200  leu2-3, 1 12  Iys2-801  trp1^63  ura3-52 

MATa  ump1-M::HIS3 

MATa 

MATa  ump1-M::HIS3 

MATa  PRE1  -ha::Ylplac21 1  ump1-M::HIS3 

MATa  UMP 1  -ha::Ylplac128  PRE1-Flag-6Hi$::Ytplac211 

MATa  UMP  1  -ha::Ylplac  128  PRE1-ha::Ylplac21 1 

MATa  UMP1-ha::Ylptac  128  ' 

MATa  UMP  1  -ha::Ylplac  128  PUP1-ha::Ylplac211 
MATa  ump1-M::HIS3  PUP1-ha::Ylpiac211 
MATa  UMP  1  -ha::Ylplac  1 28  PRE2-ha::Ylplac21 1 
MATa  ump1-M::HIS3  PRE2-ha::Ylpfac21 1 
MATa  UMP1-ha::Ylplac128  PRE3-ha::Ytplac21 1 
MATa  ump1-M::HIS3  PRE3-ha::Ylplac21 1 
MATa  PRE2-ha::Ylplac21 1 
MATa  PUP1-ha::Ylplac211 

MATa  his3-A200  Ieu2-3,112  Iys2-801  trpl-1  ura3-52 
MATa/ MATa  ump  1-M ::HIS3/ump  1-h2::LEU2 
MATa  ieu2-3, 1 12  his3-k200  trpl-1  ura3-52 
MATa  teu2-3, 1 12  his3-A200  trpl-1  ura3-52 
pre2(=doa3)M ::HIS3  (YCpUbD0A3ALS-His)  (YEpDOAS^) 
MATa  ump1-&3 

pre2(~doa3)M::HIS3  (YCpUbD0A3ALS-His)  (YEpD0A3LS) 
MATa  ump1-/±3 

pre2(=doa3)-M::HIS3  (YCpUbD0A3ALS-His) 

MATa  ura3  his3-11  Ieu2-3,112 
MATa  prel-1  ura3  his3-11  Ieu2-3,112 
MATa  UMP1-ha::Ylplac128 

MATa  prel-1  UMP1-ha::Ylplac128  _ _ _ 


Source/Comment 

Sikorski  and  Hieter,  1989 
Derivative  of  YPH500 
Dohmen  et  a!.,  1995 
Derivative  of  JD47-13C 
Derivative  of  JD47-13C 
Derivative  of  JD53 
Derivative  of  JD47-13C 
Derivative  of  JD47-13C 
Derivative  of  JD71 
Derivative  of  JD127 
Derivative  of  JD1 29 
Derivative  of  JD59 
Derivative  of  JD1 29 
Derivative  of  JD59 
Derivative  of  JD1 29 
Derivative  of  JD59 
Derivative  of  JD47-1 3C 
Derivative  of  JD47-13C 
Bartel  et  al.,  1990 
Congenic  with  BBY45 
Congenic  with  BBY45 
Chen  and  Hochstrasser,  1996, 
congenic  with  BBY45 
Derivative  of  MHY952 

Derivative  of  JD160, 
cured  of  YEpDOA3LS 
Heinemeyer  et  al.,  1993 
Heinemeyer  et  al.,  1993 
Derivative  of  WCG4a 
Derivative  of  YH129/1 


biochemical  analyses  of  the  proteasome  maturation  in 
mammalian  cells  (Frentzel  et  al.,  1994;  Yang  et  al.,  1995; 
Nandi  et  al.,  1997;  Schmidtke  et  al.,  1997). 

The  results  of  the  present  work  further  support  this 
model.  In  addition,  we  discovered  that  a  novel  factor, 
Umpl  p,  is  required  for  autocatalytic  active  site  matura¬ 
tion  and  proteasome  assembly  to  occur  in  a  coordinated 
fashion.  A  model  that  accounts  for  our  findings  and  is 
consistent  with  the  available  evidence  is  illustrated  in 
Figure  6.  In  this  model,  Umplp  has  a  chaperone-like 
function  required  for  the  efficient  processing  of  the  pro¬ 
peptides  of  p  subunits  upon  the  assembly  of  20S  protea- 
somes.  Specifically,  Umpl  p  interacts  with  these  p  sub¬ 
units,  most  likely  with  their  propeptides  (structure  A  in 
Figure  6).  Upon  formation  of  the  20S  proteasome  (Figure 
6,  step  1),  a  conformational  change  of  Umplp,  or  a 
change  of  its  position  within  the  complex,  would  induce 
a  conformational  change  of  the  propeptide  that  results 
in  autocatalytic  processing  (Figure  6,  step  2). 

Why  is  the  propeptide  of  Pre2p  essential  for  protea¬ 
some  assembly  and  activation  only  in  the  presence  of 
Umplp?  We  propose  that  the  propeptide  of  Pre2p  is 
required  to  induce  an  alteration  of  conformation  or  posi¬ 
tion  of  Umplp  upon  the  assembly  of  the  20S  particle. 
In  this  model,  the  absence  of  the  propeptide  would  leave 
Umpl  p  in  a  position  that  is  incompatible  with  the  forma¬ 
tion  of  active  proteasome.  It  is  possible  (and  remains 
to  be  verified)  that  this  chaperone-like  function  of  a  pro¬ 
peptide  is  unique  to  Pre2p,  as  it  contains  a  much  longer 
propeptide  (75  residues)  than  the  other  p  subunits. 

In  the  model  of  Figure  6,  Umplp  is  a  metabolically 


unstable  chaperone  that  is  required  for  the  proper  (and 
properly  timed)  processing  of  p  subunits  upon  the  as¬ 
sembly  of  the  20S  proteasome  and  that  is  destroyed 
within  the  newly  activated  proteasome.  The  unusual  me¬ 
chanics  of  Umpl  p,  its  noncatalytic  mode  of  action,  and 
its  degradation  by  the  protease  it  helps  to  activate  char¬ 
acterize  Umpl  p  as  a  novel  type  of  molecular  chaperone. 

Experimental  Procedures 
Yeast  Media 

Yeast  rich  (YPD)  and  synthetic  (S)  minimal  media  with  2%  dextrose 
(SD)  or  2%  galactose  (SG)  were  prepared  as  described  (Dohmen  et 
al.,  1995). 

Isolation  of  the  UMP1  Gene 

UMP1  was  identified  using  a  selection-based  screen  for  mutants  in 
the  N-end  rule  pathway  (reviewed  by  Varshavsky,  1996).  S.  cerevis- 
iae  strain  YPH500  (Table  1)  was  transformed  with  two  plasmids: 
pJD205,  which  expressed  a  Ura3p-based  N-end  rule  reporter  sub¬ 
strate  (Arg-Tpil  p-Ura3p),  and  pRL2,  which  expressed  Arg-p-gal, 
another  N-end  rule  reporter  substrate  that  can  be  monitored  using 
X-Gal  plate  assays  (Baker  and  Varshavsky,  1995).  Previous  work 
(Dohmen  et  al.,  1994)  has  shown  that  rapid  degradation  of  a  Ura3- 
based  N-end  rule  substrate  renders  the  cells  Ura~,  whereas  mutants 
in  the  N-end  rule  pathway  that  express  the  same  Ura3p-based  re¬ 
porter  are  Ura+.  In  this  screen,  Ura+  isolates  were  selected  on  SD 
media  lacking  uracil  and  were  then  tested  on  X-Gal  plates  to  verify 
that  the  same  isolates  were  also  defective  in  the  degradation  of  the 
Arg-p-gal  N-end  rule  substrate.  One  of  the  mutants  thus  identified 
(umpl-1)  was  cured  of  pJD205  and  transformed  with  a  genomic 
yeast  library  (Rose  et  a!.,  1987).  Six  transformants  yielded  plasmids 
with  four  overlapping  inserts  that  could  restore  the  ability  of  cells 
to  degrade  Arg-p-gal.  A  1981  bp  BamHI/Sau3A  fragment  common 


to  all  of  the  inserts  was  sequenced  and  found  to  contain  one  com¬ 
plete  ORF  and  two  flanking  incomplete  ORFs.  Further  mapping, 
using  subcloning  and  PCR,  confirmed  that  the  complete  ORF  of  445 
bp  was  responsible  for  the  complementation.  The  UMP1  sequence 
(EMBL  database  accession  number:  AJ002557)  is  identical  to  ORF 
YBR173C,  an  ORF  subsequently  identified  by  the  yeast  genome 
project. 

Construction  of  Yeast  Strains  and  Plasmids 
Table  1  lists  the  strains  used  in  this  study.  To  construct  umpl  A 
alleles,  the  1981  bp  fragment  described  above  was  subcloned  into 
M13mp19.  Using  single-stranded  DNA  of  the  resulting  phage,  a 
synthetic  oligonucleotide  and  T4  DNA  polymerase,  the  UMP1  ORF 
was  precisely  deleted  and  replaced  by  a  Bglll  restriction  site.  The 
resulting  fragment  was  subcloned  into  pUC19,  and  the  Bglll  site 
was  used  to  insert  fragments  containing  the  HIS3,  LEU2 ,  or  the 
URA3  gene,  the  latter  one  being  flanked  by  two  direct  repeats  of  a 
segment  of  the  £  coli  hisG  gene  (Alani  et  al.,  1987).  The  resulting 
deletion  alleles  (ump1-M::HI$3,  ump1-&2::LEU2 ,  and  umpl- 
A 3::URA3)  were  isolated  as  BamHI  fragments,  introduced  into  S. 
cerevisiae,  and  used  to  delete  the  UMP1  gene  (Rothstein,  1991). 
umpl A  strains  carrying  an  unmarked  ump1-&3  allele  (resulting  from 
recombination  between  the  hisG  repeats)  were  selected  on  plates 
containing  5-fluoroorotic  acid  (Alani  et  al.,  1987).  Construction  of 
chromosomal  ORFs  that  expressed  C  terminally  tagged  versions  of 
Umplp  or  proteasome  subunits  (Prelp,  Pre2p,  Pre3p,  and  Puplp) 
instead  of  their  wt  counterparts  was  performed  as  follows.  Using 
primers  that  contained  flanking  EcoRi  and  Kpnl  sites,  3'  portions 
of  the  respective  genes  were  amplified  by  PCR.  These  sites  were 
then  used  to  insert  the  amplified  fragments  into  integrative  plasmids 
based  on  Ylplacl  28  (LEU2  marked)  or  Ylplac21 1  {URA3  marked) 
(Gietz  and  Sugino,  1988)  that  contained  sequences  encoding  epi¬ 
tope  tags  followed  by  the  terminator  sequence  of  the  CYC  1  gene 
(TCycr)-  Each  of  the  resulting  plasmids  was  linearized  within  the  cod¬ 
ing  sequence  for  targeted  integration  into  the  S.  cerevisiae  genome, 
yielding  strains  with  one  copy  of  the  respective  gene  (fused  in-frame 
to  the  tag-coding  sequence)  expressed  from  its  natural  promoter 
and  Icycu  in  addition  to  a  3'  portion  of  the  same  gene  without 
promoter.  The  epitope  tags  used  were  double  ha  (“ha”)  and  Fiag- 
His6  (“FH”)  (Dohmen  et  al.,  1995).  Plasmid  pJD429  ( CEN/URA3 )  ex¬ 
pressing  a  UMP1  cDNA  from  PGAL1  was  isolated  from  a  cDNA 
library  (Liu  et  al.,  1992)  through  complementation  of  the  umpl  A 
mutation. 

Pulse-Chase  Analyses 

R-p-gal  (Ub-R-p-gal)  and  Ub-P-p-gal  (Bachmair  et  al.,  1986)  were 
expressed  in  the  MATa  strains  JD47-13C  or  JD59.  MATa21-6a-p-gal 
was  expressed  from  the  plasmid  YCp50-a2deg1-p-gal  (Hochstrasser 
and  Varshavsky,  1 990)  in  the  MATa  strains  JD53  and  JD81  -1 A  (Table 
1).  Pulse  labeling  for  5  min  with  Redivue  Promix  P5S]  (Amersham) 
followed  by  a  chase  and  immunoprecipitation  with  monoclonal 
anti-p-gal  (Promega)  or  anti-ha  (16B12,  Babco)  antibodies  were 
carried  out  as  described  by  Dohmen  et  al.  (1 991 ).  proteins  frac¬ 
tionated  by  SDS-PAGE  were  detected  by  fluorography. 

Fractionation  of  Whole-Cell  Extracts  by  Gel  Filtration 
S.  cerevisiae  were  grown  at  30°C  in  YPD  or  in  SD  media  to  ODeoo  of 
1 .4  ±  0.1 ,  harvested  at  3000g,  washed  with  cold  water,  frozen  in 
liquid  nitrogen,  and  stored  at  -80°C.  Cell  paste  was  ground  to 
powder  in  a  mortar  in  the  presence  of  liquid  nitrogen.  The  extraction 
buffer  was  50  mM  Tris-HCI  (pH  7.5),  2  mM  ATP,  5  mM  MgCl2, 1  mM 
DTT,  1 5%  glycerol,  used  at  2  ml  per  gram  of  pelleted  yeast  cells. 
After  centrifugation  at  31,000g  for  10  min  at  2°C,  the  supernatant 
was  subjected  to  a  second  centrifugation  at  60,000g  for  30  min, 
yielding  an  extract  with  the  protein  concentration  of  ~5  mg/ml.  The 
protein  concentration  of  extracts  was  equalized  in  parallel  experi¬ 
ments,  using  the  extraction  buffer.  Using  the  FPLC  system  (Phar¬ 
macia),  200  |xl  samples  of  an  extract  were  chromatographed  on  a 
Superose-6  column  equilibrated  with  extraction  buffer.  The  flow 
rate  was  0.3  ml/min  and  fractions  of  0.6  ml  were  collected.  The 
Superose-6  column  was  calibrated  using  the  following  standards: 
thyroglobulin  (669  kDa),  ferritin  (443  kDa),  and  bovine  serum  albumin 
(66  kDa).  Dextran  blue  was  used  to  monitor  the  void  volume. 


Assays  for  Proteolytic  Activities  with  Fluorogenic 
Peptide  Substrates 

To  determine  the  chymotrypsin-like  activity,  20  *d  of  the  protein 
fraction  and  20  ^1  of  0.5  mM  succinyl-Leu-Leu-Val-Tyr-7-amido-4- 
methyicoumarin  in  50  mM  Tris-HCI  (pH  7.8),  2  mM  ATP,  5  mM  MgCI2, 
and  1  mM  DTT  were  mixed  and  incubated  for  1  hr  at  37°C.  The 
reaction  was  stopped  by  addition  of  960  jil  of  cold  ethanol,  and 
the  fluorescence  was  measured  at  440  nm,  using  the  excitation 
wavelength  of  380  nm.  The  trypsin-like  activity  and  the  peptidylglu- 
tamyl  peptide-hydrolyzing  activity  were  determined  with,  respec¬ 
tively,  Cbz-Ala-Ala-Arg-4MeO-p-naphthytamide  and  Cbz-Leu-Leu- 
Glu-p-naphthylamide  as  fluorogenic  peptide  substrates  (Fischer  et 
al.,  1994),  except  that  the  volume  of  the  protein  fraction  was  75 
the  reactions  were  stopped  after  270  min  at  37°C,  and  the  fluores¬ 
cence  emission  was  measured  at  366  nm,  using  an  excitation  wave¬ 
length  of  420  nm.  One  unit  (U)  of  a  proteolytic  activity  is  defined  as 
1  jimol  of  the  fluorophore  produced  per  min  under  these  conditions. 

Immunoprecipitation,  Electrophoresis,  and  Immunoblotting 
Immunoprecipitations  were  carried  out  using  either  monoclonal  anti¬ 
tag  antibodies  (see  below),  or  polyclonal  rabbit  antisera  raised 
against  the  yeast  20S  proteasome  (a  gift  from  K.  Tanaka),  or  against 
Ump1p-His6  expressed  in  E.  coii.  For  immunoblotting,  the  protein 
samples  were  boiled  for  3  min  in  the  presence  of  2%  SDS  and 
0.1  M  2-mercaptoethanol,  then  subjected  to  1 2%  SDS-PAGE,  and 
thereafter  transferred  onto  a  PVDF  membrane  (Millipore)  in  a  wet 
blot  system  (BioRad).  The  blots  were  incubated  with  either  rabbit 
anti-ubiquitin  (Ramos  et  al.,  1995),  or  anti-Cim3p  (Ghislain  et  al., 
1993)  polyclonal  antibodies,  or  16B12  anti-ha  (BabCo),  or  M2  anti- 
Flag  (Kodak)  monoclonal  antibodies,  and  were  processed  as  de¬ 
scribed  (Ramos  et  al.,  1 995),  except  that  the  initially  blotted  proteins 
were  visualized  using  horseradish  peroxidase-conjugated  goat  anti¬ 
mouse  or  anti-rabbit  IgGs,  the  chemiluminescence  blotting  sub¬ 
strate  detection  system  from  Boehringer  Mannheim,  and  X-ray  films. 
Nondenaturing  4.5%  acrylamide  gel  electrophoresis  was  performed 
as  described  by  Hough  et  al.  (1 987),  and  the  gels  were  incubated  for 
1 5  min  in  transfer  buffer  containing  0.1  %  SDS  before  electrotransfer. 

Affinity  Purification  of  Proteasomes  and  Related  Complexes 
The  20S  proteasome  and  its  precursors  were  purified  from  strain 
JD126  that  expressed  Prelp-FH  and  Umptp-ha.  Crude  extracts 
were  prepared  as  described  above,  followed  by  an  exchange  of 
buffer  to  50  mM  Na-phosphate  (pH  8.0),  0.3  M  NaCI,  using  PD-1 0 
columns  (Pharmacia).  The  extract  was  then  incubated  with  Ni-NTA 
agarose  (Qiagen)  in  batch  for  2  hr  at  4°C,  followed  by  washings  and 
elution  according  to  the  manufacturer's  protocol,  except  that  the 
(pH  6.0)  washing  buffer  contained  20  mM  imidazole.  His6-tagged 
proteins  were  eluted  with  a  step  gradient  of  100-500  mM  imidazole. 
Active  fractions  containing  a  mixture  of  the  mature  proteasome  and 
its  precursors  were  pooled,  diluted  2-fold  with  50  mM  Tris-HCI  (pH 
7.5),  and  incubated  for  2  hr  at  4°C  in  batch  with  0.5  ml  anti-Flag  M2 
antibody  affinity  resin  (IBI/Eastman  Kodak)  that  had  been  equili¬ 
brated  in  TN  buffer  (0.15  M  NaCI,  50  mM  Tris-HCI,  [pH  7.5]).  After 
the  loading,  the  resin  was  washed  with  TN  buffer.  Flag-tagged  pro¬ 
teins  were  specifically  eluted  with  the  Rag  epitope  peptide  (IBI/ 
Eastman  Kodak). 
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